leiwu0 / sgd.stability
Analyze the dynamic stability of SGD
☆12Updated 6 years ago
Alternatives and similar repositories for sgd.stability:
Users that are interested in sgd.stability are comparing it to the libraries listed below
- Convolutional Neural Tangent Kernel☆109Updated 5 years ago
- NTK reading group☆88Updated 5 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆104Updated 4 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆141Updated last year
- ☆67Updated 5 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆130Updated 5 years ago
- paper lists and information on mean-field theory of deep learning☆75Updated 5 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 5 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆136Updated 5 years ago
- ☆82Updated 5 years ago
- ☆58Updated last year
- The codebase for the paper "A Tail-Index Analysis of Stochastic Gradient Noise in Deep Neural Networks"☆23Updated 5 years ago
- ☆15Updated 4 years ago
- ☆29Updated 4 years ago
- Testing Nerual Tangent Kernel (NTK) on small UCI datasets☆83Updated 5 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 4 years ago
- ☆156Updated 2 years ago
- ☆36Updated 3 years ago
- ☆121Updated 8 months ago
- Path-SGD: Path-Normalized Optimization in Deep Neural Networks☆19Updated 6 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 5 years ago
- Experiments for the paper "Exponential expressivity in deep neural networks through transient chaos"☆70Updated 8 years ago
- Implementation of Invariant Risk Minimization https://arxiv.org/abs/1907.02893☆85Updated 5 years ago
- Hessian spectral density estimation in TF and Jax☆121Updated 4 years ago
- Reproduction and analysis of SNIP paper☆30Updated 5 years ago
- Code for the paper "Understanding Generalization through Visualizations"☆60Updated 4 years ago
- Hypergradient descent☆145Updated 8 months ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- This repository is no longer maintained. Check☆81Updated 4 years ago
- Monotone operator equilibrium networks☆51Updated 4 years ago