Abstract
In this note, we consider the complexity of optimizing a highly smooth (Lipschitz k-th order derivative) and strongly convex function, via calls to a k-th order oracle which returns the value and first k derivatives of the function at a given point, and where the dimension is unrestricted. Extending the techniques introduced in Arjevani et al. [2019], we prove that the worst-case oracle complexity for any fixed k to optimize the function up to accuracy ϵ is on the order of (μkDk−1λ)23k+1+loglog(1ϵ) (in sufficiently high dimension, and up to log factors independent of ϵ), where μk is the Lipschitz constant of the k-th derivative, D is the initial distance to the optimum, and λ is the strong convexity parameter.
| Original language | English |
|---|---|
| Number of pages | 21 |
| Journal | arxiv.org |
| State | Published - 13 Oct 2020 |
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver