amirgholami / adahessianLinks
ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning
☆277Updated 2 years ago
Alternatives and similar repositories for adahessian
Users that are interested in adahessian are comparing it to the libraries listed below
Sorting:
- BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.☆588Updated 7 months ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆147Updated last year
- Efficient PyTorch Hessian eigendecomposition tools!☆376Updated last year
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆189Updated 8 months ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆215Updated 2 months ago
- Distributed K-FAC preconditioner for PyTorch☆89Updated this week
- End-to-end training of sparse deep neural networks with little-to-no performance loss.☆324Updated 2 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 4 years ago
- Create animations for the optimization trajectory of neural nets☆158Updated last year
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 6 years ago
- PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks☆750Updated last month
- 🧀 Pytorch code for the Fromage optimiser.☆126Updated last year
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆146Updated 2 years ago
- ☆157Updated 3 years ago
- ☆233Updated 6 months ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆208Updated last year
- ☆192Updated 4 years ago
- Hessian spectral density estimation in TF and Jax☆123Updated 4 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆182Updated 4 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆216Updated 4 years ago
- Butterfly matrix multiplication in PyTorch☆174Updated last year
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆274Updated 2 years ago
- Code for experiments in my blog post on the Neural Tangent Kernel: https://eigentales.com/NTK☆176Updated 5 years ago
- ☆226Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆179Updated last week
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆57Updated 3 years ago
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.☆251Updated 2 years ago
- Approximating neural network loss landscapes in low-dimensional parameter subspaces for PyTorch☆339Updated last year
- ☆67Updated 6 years ago
- ☆144Updated 2 years ago