orobix / fwdgrad
Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch
☆108Updated last year
Alternatives and similar repositories for fwdgrad:
Users that are interested in fwdgrad are comparing it to the libraries listed below
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆211Updated last week
- ☆202Updated 2 years ago
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆185Updated 3 months ago
- Distributed K-FAC Preconditioner for PyTorch☆85Updated last week
- ☆60Updated 3 years ago
- ☆66Updated 6 years ago
- Create animations for the optimization trajectory of neural nets☆147Updated last year
- Neural Tangent Kernel Papers☆108Updated 2 months ago
- PyTorch implementation of Mixer-nano (#parameters is 0.67M, originally Mixer-S/16 has 18M) with 90.83 % acc. on CIFAR-10. Training from s…☆31Updated 3 years ago
- Easy Hypernetworks in Pytorch and Jax☆99Updated 2 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆38Updated 5 years ago
- ☆221Updated last month
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆56Updated 3 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆53Updated 3 weeks ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆122Updated last year
- Accelerated First Order Parallel Associative Scan☆180Updated 7 months ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆130Updated 5 years ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆52Updated 4 months ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆108Updated 2 years ago
- Butterfly matrix multiplication in PyTorch☆168Updated last year
- Lightning-like training API for JAX with Flax☆38Updated 3 months ago
- Package for working with hypernetworks in PyTorch.☆122Updated last year
- Structured matrices for compressing neural networks☆66Updated last year
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆74Updated 8 months ago
- Collect optimizer related papers, data, repositories☆89Updated 4 months ago
- Hessian backpropagation (HBP): PyTorch extension of backpropagation for block-diagonal curvature matrix approximations☆20Updated 2 years ago
- ☆65Updated 3 months ago
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆269Updated 2 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆171Updated this week
- ☆54Updated 8 months ago