facebookresearch / dadaptationLinks

D-Adaptation for SGD, Adam and AdaGrad

☆524

Alternatives and similar repositories for dadaptation

Users that are interested in dadaptation are comparing it to the libraries listed below

Sorting:

kozistr / pytorch_optimizer
optimizer & lr scheduler & loss function collections in PyTorch
☆327Updated last week
changjonathanc / minLoRA
minLoRA: a minimal PyTorch library that allows you to apply LoRA to any PyTorch model.
☆470Updated 2 years ago
kyegomez / Sophia
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
☆378Updated last year
kach / gradient-descent-the-ultimate-optimizer
Code for our NeurIPS 2022 paper
☆369Updated 2 years ago
lucidrains / memory-efficient-attention-pytorch
Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"
☆379Updated 2 years ago
samuela / git-re-basin
Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"
☆486Updated 2 years ago
google / learned_optimization
☆783Updated 2 months ago
lucidrains / ema-pytorch
A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model
☆601Updated 8 months ago
konstmish / prodigy
The Prodigy optimizer and its variants for training neural networks.
☆408Updated 6 months ago
lucidrains / Adan-pytorch
Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch
☆252Updated 2 years ago
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆389Updated this week
lucidrains / lion-pytorch
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
☆2,153Updated 8 months ago
iShohei220 / adopt
Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"
☆429Updated 7 months ago
AminRezaei0x443 / memory-efficient-attention
Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch
☆184Updated 2 years ago
Liuhong99 / Sophia
The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
☆965Updated last year
pytorch / tensordict
TensorDict is a pytorch dedicated tensor container.
☆949Updated last week
TorchJD / torchjd
Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…
☆264Updated this week
lucidrains / rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
☆727Updated last week
facebookresearch / optimizers
For optimization algorithm research and development.
☆524Updated this week
apple / ml-sigma-reparam
☆307Updated last year
archinetai / surgeon-pytorch
A library to inspect and extract intermediate layers of PyTorch models.
☆473Updated 3 years ago
HomebrewML / HeavyBall
Efficient optimizers
☆253Updated this week
xl0 / lovely-tensors
Tensors, for human consumption
☆1,271Updated last month
facebookresearch / torchdim
Named tensors with first-class dimensions for PyTorch
☆332Updated 2 years ago
BlackHC / toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
☆437Updated 11 months ago
fferflo / einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
☆392Updated 4 months ago
facebookresearch / dropout
Code release for "Dropout Reduces Underfitting"
☆313Updated 2 years ago
epistoteles / TensorHue
TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…
☆119Updated 5 months ago
OATML / RHO-Loss
☆208Updated 2 years ago
matthias-wright / flaxmodels
Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.
☆255Updated 4 months ago