|
Math @ Duke
|
Publications [#382823] of Rong Ge
Papers Published
- Chen, Z; Ge, R, Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian Input,
Advances in Neural Information Processing Systems, vol. 37
(January, 2024)
(last updated on 2026/01/18)
Abstract: In this work, we study the mean-field flow for learning subspace-sparse polynomials using stochastic gradient descent and two-layer neural networks, where the input distribution is standard Gaussian and the output only depends on the projection of the input onto a low-dimensional subspace. We establish a necessary condition for SGD-learnability, involving both the characteristics of the target function and the expressiveness of the activation function. In addition, we prove that the condition is almost sufficient, in the sense that a condition slightly stronger than the necessary condition can guarantee the exponential decay of the loss functional to zero.
|
|
|
|
dept@math.duke.edu
ph: 919.660.2800
fax: 919.660.2821
| |
Mathematics Department
Duke University, Box 90320
Durham, NC 27708-0320
|
|