TY - JOUR
T1 - Sequential multi-agent exploration for a common goal
AU - Rochlin, Igor
AU - Sarne, David
AU - Zussman, Gil
PY - 2013
Y1 - 2013
N2 - Motivated by applications in Dynamic Spectrum Access Networks, we focus on a system in which a few agents are engaged in a costly individual exploration process where each agent's benefit is determined according to the minimum obtained value. Such an exploration pattern is applicable to many systems, including shipment and travel planning. This paper formally introduces and analyzes a sequential variant of the general model. According to that variant, only a single agent engages in exploration at any given time, and when an agent initiates its exploration, it has complete information about the minimum value obtained by the other agents so far. We prove that the exploration strategy of each agent, according to the equilibrium of the resulting Stackelberg game, is reservation-value based, and show how the reservation values can be calculated. We also analyze the agents' expected-benefit maximizing exploration strategies when they are fully cooperative (i.e., when they aim to maximize the expected joint benefit). The equilibrium strategies and the expected benefit of each agent are illustrated using a synthetic homogeneous environment, thereby demonstrating the properties of this new exploration scheme and the benefits of cooperation.
AB - Motivated by applications in Dynamic Spectrum Access Networks, we focus on a system in which a few agents are engaged in a costly individual exploration process where each agent's benefit is determined according to the minimum obtained value. Such an exploration pattern is applicable to many systems, including shipment and travel planning. This paper formally introduces and analyzes a sequential variant of the general model. According to that variant, only a single agent engages in exploration at any given time, and when an agent initiates its exploration, it has complete information about the minimum value obtained by the other agents so far. We prove that the exploration strategy of each agent, according to the equilibrium of the resulting Stackelberg game, is reservation-value based, and show how the reservation values can be calculated. We also analyze the agents' expected-benefit maximizing exploration strategies when they are fully cooperative (i.e., when they aim to maximize the expected joint benefit). The equilibrium strategies and the expected benefit of each agent are illustrated using a synthetic homogeneous environment, thereby demonstrating the properties of this new exploration scheme and the benefits of cooperation.
KW - Cooperation
KW - Dynamic spectrum access networks
KW - Game theory
KW - Multi-agent exploration
KW - Multilateral search
UR - http://www.scopus.com/inward/record.url?scp=84897996954&partnerID=8YFLogxK
U2 - 10.3233/wia-130272
DO - 10.3233/wia-130272
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:84897996954
SN - 1570-1263
VL - 11
SP - 221
EP - 244
JO - Web Intelligence and Agent Systems
JF - Web Intelligence and Agent Systems
IS - 3
ER -