YiwenShaoStephen / NGD-SGDLinks
A Pytorch Implementation of Natural Gradient Descent
☆46Updated 6 years ago
Alternatives and similar repositories for NGD-SGD
Users that are interested in NGD-SGD are comparing it to the libraries listed below
Sorting:
- Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch☆113Updated 2 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆146Updated 2 years ago
- ☆164Updated 2 years ago
- Distributed K-FAC preconditioner for PyTorch☆90Updated last week
- A PyTorch Implementation of the Sparsemax operator (https://arxiv.org/pdf/1803.09820.pdf)☆34Updated 2 years ago
- Adaptive Gradient Clipping☆147Updated 2 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 6 years ago
- ☆24Updated 10 months ago
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆14Updated 10 months ago
- Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"☆84Updated 3 years ago
- Easy Hypernetworks in Pytorch and Jax☆104Updated 2 years ago
- Online Normalization for Training Neural Networks (Companion Repository)☆84Updated 4 years ago
- Accelerated First Order Parallel Associative Scan☆188Updated last year
- Beyond Straight-Through☆102Updated 2 years ago
- Sequence Modeling with Structured State Spaces☆66Updated 3 years ago
- Vector Quantized Autoregressive Predictive Coding (VQ-APC)☆37Updated 4 years ago
- Automatic differentiation with weighted finite-state transducers.☆126Updated 3 years ago
- Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)☆78Updated last year
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆189Updated 9 months ago
- Implementation of Flow++ in PyTorch☆40Updated 6 years ago
- A PyTorch implementation of "Continuous Relaxation Training of Discrete Latent Variable Image Models"☆73Updated 5 years ago
- Jax/Flax implementation of Variational-DiffWave.☆40Updated 3 years ago
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆68Updated 3 years ago
- Fast Discounted Cumulative Sums in PyTorch☆96Updated 4 years ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆57Updated 2 weeks ago
- PyTorch implementations of normalizing flow and its variants.☆78Updated 4 years ago
- Rational Activation Functions - Replacing Padé Activation Units☆97Updated 6 months ago
- Pytorch implementation of KFAC - this is a port of https://github.com/tensorflow/kfac/☆26Updated last year
- GBDT-NAS☆28Updated 3 years ago
- ☆66Updated last year