leiwu0 / sgd.stability
Analyze the dynamic stability of SGD
☆12Updated 6 years ago
Alternatives and similar repositories for sgd.stability:
Users that are interested in sgd.stability are comparing it to the libraries listed below
- Convolutional Neural Tangent Kernel☆109Updated 5 years ago
- NTK reading group☆88Updated 5 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆141Updated last year
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 5 years ago
- Testing Nerual Tangent Kernel (NTK) on small UCI datasets☆83Updated 5 years ago
- paper lists and information on mean-field theory of deep learning☆75Updated 5 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆104Updated 4 years ago
- This repository is no longer maintained. Check☆81Updated 4 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 5 years ago
- ☆36Updated 3 years ago
- ☆156Updated 2 years ago
- ☆67Updated 5 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆130Updated 5 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 5 years ago
- ☆82Updated 5 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 4 years ago
- ☆15Updated 4 years ago
- Path-SGD: Path-Normalized Optimization in Deep Neural Networks☆19Updated 6 years ago
- Experiments for the paper "Exponential expressivity in deep neural networks through transient chaos"☆70Updated 8 years ago
- Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK☆172Updated 5 years ago
- Hessian spectral density estimation in TF and Jax☆121Updated 4 years ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 4 years ago
- ☆58Updated last year
- Implementation of the Functional Neural Process models☆43Updated 4 years ago
- ☆150Updated 4 years ago
- Implementation of Invariant Risk Minimization https://arxiv.org/abs/1907.02893☆85Updated 5 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆136Updated 5 years ago
- ☆29Updated 4 years ago
- Code for ICML 2018 paper on "Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam" by Khan, Nielsen, Tangkaratt, Lin, …☆111Updated 6 years ago
- Discrete Normalizing Flows implemented in PyTorch☆109Updated 3 years ago