Abstract
We revisit the problem of online linear optimization in the case where the set of feasible actions is accessible through an approximated linear optimization oracle with a factor α multiplicative approximation guarantee. This setting in particular is interesting because it captures natural online extensions of well-studied offline linear optimization problems that are NP-hard yet admit efficient approximation algorithms. The goal here is to minimize the α-regret, which is the natural extension to this setting of the standard regret in online learning. We present new algorithms with significantly improved oracle complexity for both the full-information and bandit variants of the problem. Mainly, for both variants, we present α-regret bounds of O(T−1/3), were T is the number of prediction rounds, using only O(log T) calls to the approximation oracle per iteration, on average. These are the first results to obtain both the average oracle complexity of O(log T) (or even polylogarithmic in T) and α -regret bound O(T−c) for a constant c > 0 for both variants.
| Original language | English |
|---|---|
| Article number | 1 |
| Pages (from-to) | 204-220 |
| Number of pages | 17 |
| Journal | Mathematics of Operations Research |
| Volume | 46 |
| Issue number | 1 |
| DOIs | |
| State | Published - Feb 2021 |
Keywords
- Approximation algorithms
- Online algorithms
- Online learning
- Online linear optimization
- Regret minimization
All Science Journal Classification (ASJC) codes
- Computer Science Applications
- General Mathematics
- Management Science and Operations Research
Fingerprint
Dive into the research topics of 'Efficient online linear optimization with approximation algorithms'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver