iShohei220 / adoptLinks

Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"

☆426

Alternatives and similar repositories for adopt

Users that are interested in adopt are comparing it to the libraries listed below

Sorting:

nanowell / AdEMAMix-Optimizer-Pytorch
The AdEMAMix Optimizer: Better, Faster, Older.
☆183Updated 10 months ago
facebookresearch / optimizers
For optimization algorithm research and development.
☆521Updated this week
TorchJD / torchjd
Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…
☆251Updated this week
HomebrewML / HeavyBall
Efficient optimizers
☆232Updated last week
KellerJordan / cifar10-airbench
CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
☆263Updated 4 months ago
srush / annotated-mamba
Annotated version of the Mamba paper
☆486Updated last year
KindXiaoming / grow-crystals
Getting crystal-like representations with harmonic loss
☆191Updated 3 months ago
epistoteles / TensorHue
TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…
☆118Updated 4 months ago
nikhilvyas / SOAP
☆197Updated 7 months ago
facebookresearch / schedule_free
Schedule-Free Optimization in PyTorch
☆2,189Updated last month
kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆279Updated last year
facebookresearch / spdl
Scalable and Performant Data Loading
☆288Updated last week
pytorch / tensordict
TensorDict is a pytorch dedicated tensor container.
☆937Updated last week
kyleliang919 / C-Optim
When it comes to optimizers, it's always better to be safe than sorry
☆246Updated 3 months ago
lucidrains / pytorch-custom-utils
Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…
☆123Updated 11 months ago
apple / ml-sigma-reparam
☆303Updated last year
facebookresearch / dadaptation
D-Adaptation for SGD, Adam and AdaGrad
☆523Updated 5 months ago
fferflo / einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
☆385Updated 3 months ago
PeaBrane / mamba-tiny
Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).
☆120Updated 8 months ago
BlackHC / neural_net_checklist
☆150Updated 11 months ago
yuanchenyang / smalldiffusion
Simple and readable code for training and sampling from diffusion models
☆511Updated last month
evanatyourservice / kron_torch
An implementation of PSGD Kron second-order optimizer for PyTorch
☆92Updated 3 months ago
lucidrains / minGRU-pytorch
Implementation of the proposed minGRU in Pytorch
☆301Updated 4 months ago
ironjr / grokfast
Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"
☆555Updated last year
helblazer811 / Diffusion-Explorer
Interactive visualizations of the geometric intuition behind diffusion models.
☆783Updated 3 weeks ago
kach / gradient-descent-the-ultimate-optimizer
Code for our NeurIPS 2022 paper
☆369Updated 2 years ago
SynodicMonth / ChebyKAN
Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.
☆380Updated last year
lucidrains / nGPT-pytorch
Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI
☆287Updated last month
zyushun / Adam-mini
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
☆427Updated 2 months ago
johnmarktaylor91 / torchlens
Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.
☆595Updated 4 months ago