leiwu0 / sgd.stability
Analyze the dynamic stability of SGD
☆12Updated 6 years ago
Alternatives and similar repositories for sgd.stability:
Users that are interested in sgd.stability are comparing it to the libraries listed below
- Convolutional Neural Tangent Kernel☆111Updated 5 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆106Updated 4 years ago
- NTK reading group☆88Updated 5 years ago
- ☆58Updated 2 years ago
- ☆36Updated 3 years ago
- ☆15Updated 5 years ago
- The codebase for the paper "A Tail-Index Analysis of Stochastic Gradient Noise in Deep Neural Networks"☆25Updated 5 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 5 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆142Updated last year
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 4 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 6 years ago
- Path-SGD: Path-Normalized Optimization in Deep Neural Networks☆19Updated 6 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 5 years ago
- Code for the paper "Understanding Generalization through Visualizations"☆60Updated 4 years ago
- ☆157Updated 2 years ago
- Experiments for the paper "Exponential expressivity in deep neural networks through transient chaos"☆71Updated 8 years ago
- paper lists and information on mean-field theory of deep learning☆74Updated 6 years ago
- ☆62Updated 3 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆139Updated 5 years ago
- ☆66Updated 6 years ago
- SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning☆23Updated 6 years ago
- Computing various measures and generalization bounds on convolutional and fully connected networks☆35Updated 6 years ago
- Testing Nerual Tangent Kernel (NTK) on small UCI datasets☆83Updated 5 years ago
- Computing various norms/measures on over-parametrized neural networks☆49Updated 6 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 2 months ago
- ☆50Updated 2 years ago
- ☆29Updated 4 years ago
- ☆65Updated 9 months ago
- Lua implementation of Entropy-SGD☆82Updated 7 years ago
- Implementation of Invariant Risk Minimization https://arxiv.org/abs/1907.02893☆86Updated 5 years ago