Lower bounds for finding stationary points I

Yair Carmon, John C. Duchi, Oliver Hinder, Aaron Sidford

Research output: Contribution to journalArticlepeer-review

Abstract

We prove lower bounds on the complexity of finding ϵ-stationary points (points x such that ‖ ∇ f(x) ‖ ≤ ϵ) of smooth, high-dimensional, and potentially non-convex functions f. We consider oracle-based complexity measures, where an algorithm is given access to the value and all derivatives of f at a query point x. We show that for any (potentially randomized) algorithm A, there exists a function f with Lipschitz pth order derivatives such that A requires at least ϵ-(p+1)/p queries to find an ϵ-stationary point. Our lower bounds are sharp to within constants, and they show that gradient descent, cubic-regularized Newton’s method, and generalized pth order regularization are worst-case optimal within their natural function classes.

Original languageEnglish
Pages (from-to)71-120
Number of pages50
JournalMathematical Programming
Volume184
Issue number1-2
DOIs
StatePublished - 1 Nov 2020
Externally publishedYes

Keywords

  • Cubic regularization of Newton’s method
  • Dimension-free rates
  • Gradient descent
  • Information-based complexity
  • Non-convex optimization

All Science Journal Classification (ASJC) codes

  • Software
  • General Mathematics

Fingerprint

Dive into the research topics of 'Lower bounds for finding stationary points I'. Together they form a unique fingerprint.

Cite this