How do infinite width bounded norm networks look in function space?

Pedro Savarese, Itay Evron, Daniel Soudry, Nathan Srebro

Research output: Contribution to journalConference articlepeer-review

Abstract

We consider the question of what functions can be captured by ReLU networks with an unbounded number of units (infinite width), but where the overall network Euclidean norm (sum of squares of all weights in the system, except for an unregularized bias term for each unit) is bounded; or equivalently what is the minimal norm required to approximate a given function. For functions f : R → R and a single hidden layer, we show that the minimal network norm for representing f is max(´ |f00(x)| dx, |f0(-∞) + f0(+∞)|), and hence the minimal norm fit for a sample is given by a linear spline interpolation.

Original languageEnglish
Pages (from-to)2667-2690
Number of pages24
JournalProceedings of Machine Learning Research
Volume99
StatePublished - 2019
Event32nd Conference on Learning Theory, COLT 2019 - Phoenix, United States
Duration: 25 Jun 201928 Jun 2019
https://proceedings.mlr.press/v99

All Science Journal Classification (ASJC) codes

  • Software
  • Artificial Intelligence
  • Control and Systems Engineering
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'How do infinite width bounded norm networks look in function space?'. Together they form a unique fingerprint.

Cite this