gpauloski / kfac-pytorch
Distributed K-FAC Preconditioner for PyTorch
☆85Updated this week
Alternatives and similar repositories for kfac-pytorch:
Users that are interested in kfac-pytorch are comparing it to the libraries listed below
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆131Updated 5 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆74Updated 8 months ago
- Sparsity support for PyTorch☆34Updated 3 weeks ago
- Butterfly matrix multiplication in PyTorch☆169Updated last year
- ☆36Updated 4 months ago
- Hessian backpropagation (HBP): PyTorch extension of backpropagation for block-diagonal curvature matrix approximations☆20Updated 2 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated last month
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆38Updated 5 years ago
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆185Updated 4 months ago
- PyTorch implementation of Hessian Free optimisation☆43Updated 5 years ago
- ☆47Updated 5 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆142Updated last year
- A library for unit scaling in PyTorch☆125Updated 4 months ago
- TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch☆78Updated 10 months ago
- A Chainer extension for K-FAC☆20Updated 5 years ago
- ☆29Updated 4 years ago
- Structured matrices for compressing neural networks☆66Updated last year
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆145Updated last year
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆45Updated 8 months ago
- ☆224Updated 2 months ago
- Hessian spectral density estimation in TF and Jax☆123Updated 4 years ago
- ☆11Updated 2 years ago
- ☆66Updated 4 months ago
- Pytorch implementation of KFAC - this is a port of https://github.com/tensorflow/kfac/☆23Updated 10 months ago
- Simple CIFAR10 ResNet example with JAX.☆23Updated 3 years ago
- ☆66Updated 6 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆78Updated 4 years ago
- Second Order Optimization and Curvature Estimation with K-FAC in JAX.☆267Updated last week
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆56Updated 3 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆53Updated last month