Abstract
Channel allocation is the task of assigning channels to users such that some objective (e.g., sum-rate) is maximized. In centralized networks such as cellular networks, this task is carried by the base station (BS) which gathers the channel state information (CSI) from the users and computes the optimal solution. In distributed networks such as ad-hoc and device-to-device (D2D) networks, no BS exists and conveying global CSI between users is costly or simply impractical. When the CSI is time varying and unknown to the users, the users face the challenge of both learning the channel statistics online and converging to a good channel allocation. This introduces a multi-armed bandit (MAB) scenario with multiple decision makers. If two or more users choose the same channel, a collision occurs and they all receive zero reward. We propose a distributed channel allocation algorithm that each user runs and converges to the optimal allocation while achieving an order optimal regret of O log T , where T denotes the length of time horizon. The algorithm is based on a carrier sensing multiple access (CSMA) implementation of the distributed auction algorithm. It does not require any exchange of information between users. Users need only to observe a single channel at a time and sense if there is a transmission on that channel, without decoding the transmissions or identifying the transmitting users. We demonstrate the performance of our algorithm using simulated LTE and 5G channels.
Original language | English |
---|---|
Article number | 8792155 |
Pages (from-to) | 2337-2349 |
Number of pages | 13 |
Journal | IEEE Journal on Selected Areas in Communications |
Volume | 37 |
Issue number | 10 |
DOIs | |
State | Published - Oct 2019 |
Bibliographical note
Publisher Copyright:© 1983-2012 IEEE.
Funding
Manuscript received December 15, 2018; revised April 5, 2019; accepted May 20, 2019. Date of publication August 8, 2019; date of current version September 16, 2019. This work was supported in part by the ISF-NRF Joint Research Program under Grant ISF 2277/16 and Grant ISF 1644/18, in part by WASP/NTU M4082187 under Grant 4080, in part by the Singapore MOE Tier 1 under Grant 2017-T1-002-007 RG122/17, and in part by NRF2015-NRF-ISF001-2277. This article was presented at the 20th IEEE International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Cannes, France, July 2019. The work of S. M. Zafaruddin was supported in part by the Israeli Planning and Budget Committee (PBC) Post-Doctoral Fellowship (2016–2018). (Corresponding author: S. M. Zafaruddin.) S. M. Zafaruddin was with the Faculty of Engineering, Bar-Ilan University, Ramat Gan 5290002, Israel. He is now with the Department of Electrical and Electronics Engineering, Birla Institute of Technology and Science at Pilani, Pilani 333031, India (e-mail: [email protected]).
Funders | Funder number |
---|---|
ISF-NRF | ISF 1644/18, ISF 2277/16 |
Israeli Planning and Budget Committee | |
WASP/NTU | 4080, M4082187 |
Ministry of Education - Singapore | 2017-T1-002-007 RG122/17, NRF2015-NRF-ISF001-2277 |
Planning and Budgeting Committee of the Council for Higher Education of Israel |
Keywords
- Distributed channel allocation
- dynamic spectrum accesses
- multiplayer multi-armed bandit
- online learning
- resource management