orobix / fwdgrad
Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch
☆108Updated last year
Alternatives and similar repositories for fwdgrad:
Users that are interested in fwdgrad are comparing it to the libraries listed below
- ☆67Updated 4 months ago
- Distributed K-FAC Preconditioner for PyTorch☆85Updated last week
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆212Updated this week
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 5 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆173Updated this week
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆56Updated 3 years ago
- ☆203Updated 2 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆38Updated 6 years ago
- ☆52Updated 6 months ago
- ☆62Updated 3 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆36Updated 2 years ago
- Neural Tangent Kernel Papers☆108Updated 3 months ago
- Structured matrices for compressing neural networks☆66Updated last year
- Butterfly matrix multiplication in PyTorch☆169Updated last year
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 9 months ago
- Create animations for the optimization trajectory of neural nets☆153Updated last year
- Modern Fixed Point Systems using Pytorch☆89Updated last year
- Accelerated First Order Parallel Associative Scan☆181Updated 8 months ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆106Updated 4 years ago
- A custom PyTorch layer that is capable of implementing extremely wide and sparse linear layers efficiently☆49Updated last year
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆123Updated last year
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆53Updated 5 months ago
- PyTorch implementation of Mixer-nano (#parameters is 0.67M, originally Mixer-S/16 has 18M) with 90.83 % acc. on CIFAR-10. Training from s…☆32Updated 3 years ago
- ☆66Updated 6 years ago
- Fast training of unitary deep network layers from low-rank updates☆28Updated 2 years ago
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆270Updated 2 years ago
- Hessian spectral density estimation in TF and Jax☆123Updated 4 years ago
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆185Updated 4 months ago
- Study on the applicability of Direct Feedback Alignment to neural view synthesis, recommender systems, geometric learning, and natural la…☆87Updated 2 years ago
- ☆224Updated 2 months ago