Multi-player bandits: The adversarial case

Pragnya Alatur, Kfir Y. Levy, Andreas Krause

Research output: Contribution to journalArticlepeer-review

Abstract

We consider a setting where multiple players sequentially choose among a common set of actions (arms). Motivated by an application to cognitive radio networks, we assume that players incur a loss upon colliding, and that communication between players is not possible. Existing approaches assume that the system is stationary. Yet this assumption is often violated in practice, e.g., due to signal strength fluctuations. In this work, we design the first multi-player Bandit algorithm that provably works in arbitrarily changing environments, where the losses of the arms may even be chosen by an adversary. This resolves an open problem posed by Rosenski et al. (2016).

Original languageEnglish
JournalJournal of Machine Learning Research
Volume21
StatePublished - 1 Apr 2020

Keywords

  • Cognitive Radio Networks
  • Multi-Armed Bandits
  • Multi-Player Problems
  • Online Learning
  • Sequential Decision Making

All Science Journal Classification (ASJC) codes

  • Software
  • Control and Systems Engineering
  • Statistics and Probability
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Multi-player bandits: The adversarial case'. Together they form a unique fingerprint.

Cite this