davda54 / ada-hessian
Easy-to-use AdaHessian optimizer (PyTorch)
☆77Updated 3 years ago
Related projects: ⓘ
- ☆96Updated 2 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (affine group preconditioner, low-rank approximation preconditioner …☆105Updated this week
- 🧀 Pytorch code for the Fromage optimiser.☆120Updated 2 months ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆103Updated 2 years ago
- 👩 Pytorch and Jax code for the Madam optimiser.☆50Updated 3 years ago
- Structured matrices for compressing neural networks☆65Updated 11 months ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆141Updated 11 months ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆207Updated 4 months ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆97Updated 3 years ago
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆263Updated last year
- Hessian spectral density estimation in TF and Jax☆112Updated 4 years ago
- Ἀνατομή is a PyTorch library to analyze representation of neural networks☆61Updated 10 months ago
- DeepOBS: A Deep Learning Optimizer Benchmark Suite☆103Updated 8 months ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆176Updated 3 years ago
- ☆35Updated 2 years ago
- Fast Discounted Cumulative Sums in PyTorch☆95Updated 3 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆212Updated 3 years ago
- Very deep VAEs in JAX/Flax☆45Updated 3 years ago
- Create animations for the optimization trajectory of neural nets☆133Updated 7 months ago
- Differentiable Algorithms and Algorithmic Supervision.☆101Updated last year
- ☆34Updated 7 months ago
- ☆33Updated 4 years ago
- Bayesianize: A Bayesian neural network wrapper in pytorch☆81Updated 4 months ago
- Codebase for Learning Invariances in Neural Networks☆94Updated last year
- ☆14Updated 4 years ago
- Pytorch implementation of Variational Dropout Sparsifies Deep Neural Networks☆83Updated 2 years ago
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆176Updated last month
- a lightweight transformer library for PyTorch☆71Updated 2 years ago
- Pytorch implementation of the Power Spherical distribution☆73Updated last month
- ☆46Updated 3 years ago