timgaripov / dnn-mode-connectivityLinks
Mode Connectivity and Fast Geometric Ensembles in PyTorch
☆282Updated 3 years ago
Alternatives and similar repositories for dnn-mode-connectivity
Users that are interested in dnn-mode-connectivity are comparing it to the libraries listed below
Sorting:
- Efficient PyTorch Hessian eigendecomposition tools!☆382Updated last year
- ☆158Updated 3 years ago
- Project site for "Your Classifier is Secretly an Energy-Based Model and You Should Treat it Like One"☆426Updated 3 years ago
- NTK reading group☆87Updated 6 years ago
- ☆59Updated 2 years ago
- A pytorch implementation of our jacobian regularizer to encourage learning representations more robust to input perturbations.☆129Updated 2 years ago
- Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK☆173Updated 6 years ago
- ☆126Updated last year
- Lipschitz Neural Networks described in "Sorting Out Lipschitz Function Approximation" (ICML 2019).☆58Updated 5 years ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆216Updated 2 weeks ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 6 years ago
- Convolutional Neural Tangent Kernel☆112Updated 6 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆149Updated 2 years ago
- ☆194Updated 2 weeks ago
- Release of CIFAR-10.1, a new test set for CIFAR-10.☆225Updated 5 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆148Updated 2 years ago
- Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH☆105Updated 5 years ago
- Understanding Training Dynamics of Deep ReLU Networks☆305Updated 2 months ago
- Rethinking Bias-Variance Trade-off for Generalization of Neural Networks☆50Updated 4 years ago
- Train ImageNet *fast* in 500 lines of code with FFCV☆149Updated last year
- Learning Sparse Neural Networks through L0 regularization☆245Updated 5 years ago
- Code for the paper "Understanding Generalization through Visualizations"☆65Updated 4 years ago
- Hypergradient descent☆147Updated last year
- Implementation of Invariant Risk Minimization https://arxiv.org/abs/1907.02893☆91Updated 5 years ago
- ☆144Updated 2 years ago
- ☆68Updated 6 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆61Updated 4 years ago
- ☆133Updated 4 years ago
- ☆83Updated 5 years ago
- ☆56Updated 5 years ago