juntang-zhuang/Adabelief-Optimizer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/juntang-zhuang/Adabelief-Optimizer)

juntang-zhuang / Adabelief-Optimizer

Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

☆1,071

Alternatives and similar repositories for Adabelief-Optimizer

Users that are interested in Adabelief-Optimizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jettify / pytorch-optimizer
View on GitHub
torch-optimizer -- collection of optimizers for Pytorch
☆3,170Mar 22, 2024Updated 2 years ago
lucidrains / lambda-networks
View on GitHub
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
☆1,528Nov 18, 2020Updated 5 years ago
lessw2020 / Ranger-Deep-Learning-Optimizer
View on GitHub
Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase
☆1,204Dec 22, 2023Updated 2 years ago
LiyuanLucasLiu / RAdam
View on GitHub
On the Variance of the Adaptive Learning Rate and Beyond
☆2,547Jul 31, 2021Updated 4 years ago
Luolc / AdaBound
View on GitHub
An optimizer that trains as fast as Adam and as good as SGD.
☆2,904Jul 23, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
clovaai / AdamP
View on GitHub
AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)
☆416Jan 13, 2021Updated 5 years ago
lessw2020 / Best-Deep-Learning-Optimizers
View on GitHub
Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable
☆218Apr 4, 2021Updated 5 years ago
facebookresearch / madgrad
View on GitHub
MADGRAD Optimization Method
☆801Jan 27, 2025Updated last year
idiap / fast-transformers
View on GitHub
Pytorch library for fast transformer implementations
☆1,773Mar 23, 2023Updated 3 years ago
arogozhnikov / einops
View on GitHub
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
☆9,553Jul 5, 2026Updated 2 weeks ago
lessw2020 / Ranger21
View on GitHub
Ranger deep learning optimizer rewrite to use newest components
☆341Mar 17, 2026Updated 4 months ago
mgrankin / over9000
View on GitHub
Over9000 optimizer
☆424Nov 22, 2022Updated 3 years ago
Yonghongwei / Gradient-Centralization
View on GitHub
A New Optimization Technique for Deep Neural Networks
☆539Jan 13, 2022Updated 4 years ago
szq0214 / MEAL-V2
View on GitHub
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks. In NeurIPS 2020 workshop.
☆700Dec 24, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lucidrains / performer-pytorch
View on GitHub
An implementation of Performer, a linear attention-based transformer, in Pytorch
☆1,179Feb 2, 2022Updated 4 years ago
digantamisra98 / Mish
View on GitHub
Official Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]
☆1,300Updated this week
facebookresearch / AugLy
View on GitHub
A data augmentations library for audio, image, text, and video.
☆5,087Updated this week
facebookresearch / xcit
View on GitHub
Official code Cross-Covariance Image Transformer (XCiT)
☆681Sep 28, 2021Updated 4 years ago
Lightning-AI / pytorch-lightning
View on GitHub
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
☆31,241Updated this week
microsoft / fastformers
View on GitHub
FastFormers - highly efficient transformer models for NLU
☆706Mar 21, 2025Updated last year
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,358Mar 15, 2024Updated 2 years ago
mit-han-lab / data-efficient-gans
View on GitHub
[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training
☆1,309Sep 24, 2024Updated last year
VITA-Group / TransGAN
View on GitHub
[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang
☆1,693Nov 3, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
davda54 / sam
View on GitHub
SAM: Sharpness-Aware Minimization (PyTorch)
☆1,983Feb 21, 2024Updated 2 years ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆36,993Updated this week
facebookresearch / fairscale
View on GitHub
PyTorch extensions for high performance and large scale training.
☆3,411Apr 26, 2025Updated last year
adobe / antialiased-cnns
View on GitHub
pip install antialiased-cnns to improve stability and accuracy
☆1,685Apr 8, 2024Updated 2 years ago
meta-pytorch / captum
View on GitHub
Model interpretability and understanding for PyTorch
☆5,672Updated this week
sail-sg / Adan
View on GitHub
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
☆819Jun 8, 2025Updated last year
facebookresearch / higher
View on GitHub
higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual tr…
☆1,629Mar 25, 2022Updated 4 years ago
kornia / kornia
View on GitHub
🐍 Geometric Computer Vision Library for Spatial AI
☆11,282Updated this week
huggingface / pytorch_block_sparse
View on GitHub
Fast Block Sparse Matrices for Pytorch
☆551Jan 21, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
NVlabs / imaginaire
View on GitHub
NVIDIA's Deep Imagination Team's PyTorch Library
☆4,078Nov 29, 2022Updated 3 years ago
sacmehta / delight
View on GitHub
DeLighT: Very Deep and Light-Weight Transformers
☆469Oct 16, 2020Updated 5 years ago
XuezheMax / apollo
View on GitHub
Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization
☆181Nov 21, 2021Updated 4 years ago
NVlabs / NVAE
View on GitHub
The Official PyTorch Implementation of "NVAE: A Deep Hierarchical Variational Autoencoder" (NeurIPS 2020 spotlight paper)
☆1,093Dec 6, 2022Updated 3 years ago
utsaslab / MONeT
View on GitHub
MONeT framework for reducing memory consumption of DNN training
☆174May 4, 2021Updated 5 years ago
mlpen / Nystromformer
View on GitHub
☆391Oct 18, 2023Updated 2 years ago
ml-jku / hopfield-layers
View on GitHub
Hopfield Networks is All You Need
☆1,957Apr 23, 2023Updated 3 years ago