chijames / GSTLinks
☆19Updated 3 years ago
Alternatives and similar repositories for GST
Users that are interested in GST are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 6 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆149Updated 2 years ago
- Efficient PyTorch Hessian eigendecomposition tools!☆382Updated last year
- ☆68Updated 6 years ago
- ☆11Updated 4 years ago
- NTK reading group☆87Updated 6 years ago
- ☆125Updated last year
- Hessian spectral density estimation in TF and Jax☆124Updated 5 years ago
- Lipschitz Neural Networks described in "Sorting Out Lipschitz Function Approximation" (ICML 2019).☆58Updated 5 years ago
- paper lists and information on mean-field theory of deep learning☆78Updated 6 years ago
- Visualizing the the loss landscape of Fully-Connected Neural Networks☆46Updated 2 years ago
- Distributed K-FAC preconditioner for PyTorch☆93Updated last week
- Convolutional Neural Tangent Kernel☆112Updated 6 years ago
- ☆59Updated 2 years ago
- ☆73Updated last year
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆43Updated 6 years ago
- ☆157Updated 3 years ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆216Updated this week
- Limitations of the Empirical Fisher Approximation☆49Updated 9 months ago
- PyTorch implementation of Mixer-nano (#parameters is 0.67M, originally Mixer-S/16 has 18M) with 90.83 % acc. on CIFAR-10. Training from s…☆36Updated 4 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆142Updated 6 years ago
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆281Updated 3 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆16Updated 6 years ago
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆281Updated 2 years ago
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆60Updated 4 years ago
- ☆193Updated this week
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆106Updated 5 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆148Updated 2 years ago
- ☆83Updated 5 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆50Updated 3 years ago