leiwu0 / sgd.stabilityLinks
Analyze the dynamic stability of SGD
☆12Updated 6 years ago
Alternatives and similar repositories for sgd.stability
Users that are interested in sgd.stability are comparing it to the libraries listed below
Sorting:
- NTK reading group☆87Updated 5 years ago
- Convolutional Neural Tangent Kernel☆111Updated 5 years ago
- ☆157Updated 3 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆143Updated 2 years ago
- Efficient PyTorch Hessian eigendecomposition tools!☆374Updated last year
- ☆29Updated 4 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 6 years ago
- ☆67Updated 6 years ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆215Updated 3 weeks ago
- hessian in pytorch☆187Updated 4 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 6 years ago
- ☆153Updated 5 years ago
- ☆123Updated last year
- paper lists and information on mean-field theory of deep learning☆78Updated 6 years ago
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆272Updated 2 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆105Updated 4 years ago
- ☆59Updated 2 years ago
- Testing Nerual Tangent Kernel (NTK) on small UCI datasets☆81Updated 5 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆147Updated last year
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆140Updated 6 years ago
- ☆144Updated 2 years ago
- Hypergradient descent☆149Updated last year
- Hessian spectral density estimation in TF and Jax☆123Updated 4 years ago
- ☆83Updated 5 years ago
- Lipschitz Neural Networks described in "Sorting Out Lipschitz Function Approximation" (ICML 2019).☆56Updated 5 years ago
- Path-SGD: Path-Normalized Optimization in Deep Neural Networks☆19Updated 6 years ago
- Implementation of Invariant Risk Minimization https://arxiv.org/abs/1907.02893☆89Updated 5 years ago
- Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK☆176Updated 5 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 6 years ago
- The codebase for the paper "A Tail-Index Analysis of Stochastic Gradient Noise in Deep Neural Networks"☆25Updated 5 years ago