Abstract
We consider the question of what functions can be captured by ReLU networks with an unbounded number of units (infinite width), but where the overall network Euclidean norm (sum of squares of all weights in the system, except for an unregularized bias term for each unit) is bounded; or equivalently what is the minimal norm required to approximate a given function. For functions f : R → R and a single hidden layer, we show that the minimal network norm for representing f is max(´ |f00(x)| dx, |f0(-∞) + f0(+∞)|), and hence the minimal norm fit for a sample is given by a linear spline interpolation.
Original language | English |
---|---|
Pages (from-to) | 2667-2690 |
Number of pages | 24 |
Journal | Proceedings of Machine Learning Research |
Volume | 99 |
State | Published - 2019 |
Event | 32nd Conference on Learning Theory, COLT 2019 - Phoenix, United States Duration: 25 Jun 2019 → 28 Jun 2019 https://proceedings.mlr.press/v99 |
All Science Journal Classification (ASJC) codes
- Software
- Artificial Intelligence
- Control and Systems Engineering
- Statistics and Probability