orobix / fwdgradLinks
Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch
☆109Updated last year
Alternatives and similar repositories for fwdgrad
Users that are interested in fwdgrad are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆175Updated last week
- ☆206Updated 2 years ago
- Create animations for the optimization trajectory of neural nets☆155Updated last year
- Distributed K-FAC preconditioner for PyTorch☆87Updated this week
- ☆68Updated 6 months ago
- ☆63Updated 3 years ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆215Updated last week
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 5 years ago
- Study on the applicability of Direct Feedback Alignment to neural view synthesis, recommender systems, geometric learning, and natural la…☆88Updated 2 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 10 months ago
- ☆53Updated 8 months ago
- ☆228Updated 3 months ago
- TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch☆78Updated 11 months ago
- Structured matrices for compressing neural networks☆66Updated last year
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆40Updated 6 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆37Updated 2 years ago
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆47Updated last year
- Collect optimizer related papers, data, repositories☆91Updated 6 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆56Updated 3 years ago
- Parallelizing non-linear sequential models over the sequence length☆51Updated 4 months ago
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆276Updated 2 years ago
- Easy Hypernetworks in Pytorch and Jax☆100Updated 2 years ago
- ☆67Updated 6 years ago
- ☆185Updated 6 months ago
- Butterfly matrix multiplication in PyTorch☆169Updated last year
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 4 years ago
- ☆74Updated 2 years ago
- Approximating neural network loss landscapes in low-dimensional parameter subspaces for PyTorch☆334Updated last year
- ☆36Updated last year