TY - JOUR
T1 - Distributed learning in congested environments with partial information
AU - Leshem, Amir
AU - Krishnamurthy, Vikram
AU - Boyarski, Tomer
N1 - Publisher Copyright:
© 2024 Elsevier Ltd
PY - 2024/11
Y1 - 2024/11
N2 - How can non-communicating agents learn to share congested resources efficiently? This is a challenging task when the agents can access the same resource simultaneously (in contrast to multi-agent multi-armed bandit problems) and the resource valuations differ among agents. We present a fully distributed algorithm for learning to share in congested environments and prove that the agents’ regret with respect to the optimal allocation is poly-logarithmic in the time horizon. Performance in the non-asymptotic regime is illustrated in numerical simulations. The distributed algorithm has applications in cloud computing and spectrum sharing.
AB - How can non-communicating agents learn to share congested resources efficiently? This is a challenging task when the agents can access the same resource simultaneously (in contrast to multi-agent multi-armed bandit problems) and the resource valuations differ among agents. We present a fully distributed algorithm for learning to share in congested environments and prove that the agents’ regret with respect to the optimal allocation is poly-logarithmic in the time horizon. Performance in the non-asymptotic regime is illustrated in numerical simulations. The distributed algorithm has applications in cloud computing and spectrum sharing.
KW - Congestion games
KW - Distributed learning
KW - Learning in dense environments
KW - Learning in games
KW - Poly-logarithmic regret
UR - http://www.scopus.com/inward/record.url?scp=85200581270&partnerID=8YFLogxK
U2 - 10.1016/j.automatica.2024.111817
DO - 10.1016/j.automatica.2024.111817
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85200581270
SN - 0005-1098
VL - 169
JO - Automatica
JF - Automatica
M1 - 111817
ER -