An optimal algorithm for bandit and zero-order convex optimization with two-point feedback

Research output: Contribution to journalArticlepeer-review

Abstract

We consider the closely related problems of bandit convex optimization with two-point feedback, and zero-order stochastic convex optimization with two function evaluations per round. We provide a simple algorithm and analysis which is optimal for convex Lipschitz functions. This improves on Duchi et al. (2015), which only provides an optimal result for smooth functions; Moreover, the algorithm and analysis are simpler, and readily extend to non-Euclidean problems. The algorithm is based on a small but surprisingly powerful modification of the gradient estimator.

Original languageEnglish
Pages (from-to)1-11
Number of pages11
JournalJournal of Machine Learning Research
Volume18
StatePublished - 1 May 2017

All Science Journal Classification (ASJC) codes

  • Software
  • Artificial Intelligence
  • Control and Systems Engineering
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'An optimal algorithm for bandit and zero-order convex optimization with two-point feedback'. Together they form a unique fingerprint.

Cite this