zoq / Awesome-OptimizerLinks
Collect optimizer related papers, data, repositories
☆99Updated last year
Alternatives and similar repositories for Awesome-Optimizer
Users that are interested in Awesome-Optimizer are comparing it to the libraries listed below
Sorting:
- Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch☆114Updated 2 years ago
- TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch☆82Updated last year
- Distributed K-FAC preconditioner for PyTorch☆94Updated this week
- Neural Tangent Kernel Papers☆121Updated last year
- ☆73Updated last year
- DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule☆64Updated 2 years ago
- ☆238Updated last year
- summer school materials☆46Updated 2 years ago
- Create animations for the optimization trajectory of neural nets☆163Updated 2 years ago
- Omnigrok: Grokking Beyond Algorithmic Data☆62Updated 2 years ago
- optimizer & lr scheduler & loss function collections in PyTorch☆388Updated last week
- A general-purpose, deep learning-first library for constrained optimization in PyTorch☆152Updated 2 months ago
- Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024☆128Updated last year
- ☆62Updated last year
- Pytorch code for experiments on Linear Transformers☆25Updated 2 years ago
- Pytorch implementation of KFAC - this is a port of https://github.com/tensorflow/kfac/☆30Updated last year
- Agustinus' very opiniated publication-ready plotting library☆70Updated 8 months ago
- Deep Learning & Information Bottleneck☆63Updated 2 years ago
- Modern Fixed Point Systems using Pytorch☆125Updated 2 years ago
- About A collection of AWESOME things about information geometry Topics☆176Updated last year
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- 😎 A curated list of tensor decomposition resources for model compression.☆103Updated 3 weeks ago
- Sampling with gradient-based Markov Chain Monte Carlo approaches☆109Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆83Updated last year
- Implementations of various linear RNN layers using pytorch and triton☆54Updated 2 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆63Updated 10 months ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆70Updated last year
- Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)☆76Updated last year
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆61Updated 4 months ago
- Mutual information estimators and benchmark☆56Updated 4 months ago