Abstract
Approachability has become a standard tool in analyzing learning algorithms in the adversarial online learning setup. We develop a variant of approachability for games where there is ambiguity in the obtained reward: it belongs to a set rather than being a single vector. Using this variant we tackle the problem of approachability in games with partial monitoring and develop a simple and generally efficient strategy (i.e., with constant per-step complexity) for this setup. As an important example, we instantiate our general strategy to the case when external regret or internal regret is to be minimized under partial monitoring.
Original language | English |
---|---|
Pages (from-to) | 3247-3295 |
Number of pages | 49 |
Journal | Journal of Machine Learning Research |
Volume | 15 |
State | Published - 1 Oct 2014 |
Keywords
- Approachability
- Online learning
- Partial monitoring
- Regret
All Science Journal Classification (ASJC) codes
- Software
- Control and Systems Engineering
- Statistics and Probability
- Artificial Intelligence