chijames / GSTLinks
☆19Updated 3 years ago
Alternatives and similar repositories for GST
Users that are interested in GST are comparing it to the libraries listed below
Sorting:
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆145Updated 2 years ago
- ☆124Updated last year
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 6 years ago
- ☆11Updated 4 years ago
- ☆157Updated 3 years ago
- ☆59Updated 2 years ago
- Efficient PyTorch Hessian eigendecomposition tools!☆375Updated last year
- Lipschitz Neural Networks described in "Sorting Out Lipschitz Function Approximation" (ICML 2019).☆56Updated 5 years ago
- NTK reading group☆87Updated 5 years ago
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆272Updated 2 years ago
- ☆67Updated 6 years ago
- Distributed K-FAC preconditioner for PyTorch☆89Updated this week
- Hessian spectral density estimation in TF and Jax☆123Updated 4 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆49Updated 3 years ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆215Updated last month
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆147Updated last year
- Convolutional Neural Tangent Kernel☆113Updated 5 years ago
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆57Updated 3 years ago
- Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch☆110Updated 2 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆140Updated 6 years ago
- PyTorch implementation of Mixer-nano (#parameters is 0.67M, originally Mixer-S/16 has 18M) with 90.83 % acc. on CIFAR-10. Training from s…☆34Updated 3 years ago
- Code for "A Spectral Approach to Gradient Estimation for Implicit Distributions" (ICML'18)☆33Updated 2 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 5 months ago
- ☆91Updated 3 years ago
- paper lists and information on mean-field theory of deep learning☆78Updated 6 years ago
- ☆192Updated 4 years ago
- ☆70Updated 8 months ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆105Updated 4 years ago
- Code release for Hoogeboom, Emiel, Jorn WT Peters, Rianne van den Berg, and Max Welling. "Integer Discrete Flows and Lossless Compression…☆98Updated 5 years ago
- Visualizing the the loss landscape of Fully-Connected Neural Networks☆46Updated 2 years ago