optimizer & lr scheduler & loss function collections in PyTorch
☆394Mar 1, 2026Updated 3 weeks ago
Alternatives and similar repositories for pytorch_optimizer
Users that are interested in pytorch_optimizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- torch-optimizer -- collection of optimizers for Pytorch☆3,167Mar 22, 2024Updated 2 years ago
- Prodigy and Schedule-Free, together at last.☆89Sep 27, 2025Updated 6 months ago
- A collection of niche / personally useful PyTorch optimizers with modified code.☆27Oct 25, 2025Updated 5 months ago
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆988Jan 30, 2024Updated 2 years ago
- 🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch☆2,183Nov 27, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- stochastic bfloat16 based optimizer library☆21Dec 4, 2024Updated last year
- D-Adaptation for SGD, Adam and AdaGrad☆530Jan 22, 2025Updated last year
- Ranger deep learning optimizer rewrite to use newest components☆341Mar 17, 2026Updated last week
- ☆21Jan 23, 2024Updated 2 years ago
- ☆256Dec 2, 2024Updated last year
- [ACL 2023] The official implementation of "CAME: Confidence-guided Adaptive Memory Optimization"☆97Mar 22, 2025Updated last year
- Testing various improvements to Ranger21 for 2022☆19Nov 6, 2024Updated last year
- Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models☆811Jun 8, 2025Updated 9 months ago
- Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.☆12Jun 5, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆37Jun 20, 2025Updated 9 months ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆218Apr 4, 2021Updated 4 years ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆128Dec 29, 2025Updated 2 months ago
- Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.☆382Jun 4, 2024Updated last year
- Collect optimizer related papers, data, repositories☆99Nov 15, 2024Updated last year
- 7th place solution to RecSys Challenge 2023 by Corca☆11Jan 8, 2024Updated 2 years ago
- Muon is an optimizer for hidden layers in neural networks☆2,428Jan 19, 2026Updated 2 months ago
- ☆23Jan 5, 2025Updated last year
- PyTorch Implementation of Variance Reduced Optimization Algorithms -- SARAH and SVRG.☆15Jul 11, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆135Oct 15, 2025Updated 5 months ago
- Efficient optimizers☆297Updated this week
- SAM: Sharpness-Aware Minimization (PyTorch)☆1,966Feb 21, 2024Updated 2 years ago
- Schedule-Free Optimization in PyTorch☆2,265May 21, 2025Updated 10 months ago
- TorchOpt is an efficient library for differentiable optimization built upon PyTorch.☆626Mar 2, 2026Updated 3 weeks ago
- ☆35Dec 5, 2022Updated 3 years ago
- Parameter-Free Optimizers for Pytorch☆131Apr 23, 2024Updated last year
- Multidimensional indexing for tensors☆138Jul 17, 2023Updated 2 years ago
- Amos optimizer with JEstimator lib.☆82May 15, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention☆271Nov 29, 2025Updated 3 months ago
- The Prodigy optimizer and its variants for training neural networks.☆453Jan 16, 2025Updated last year
- ☆13Apr 16, 2022Updated 3 years ago
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,207Dec 22, 2023Updated 2 years ago
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆15Dec 11, 2023Updated 2 years ago
- Benchmarking Optimizers for LLM Pretraining☆56Dec 30, 2025Updated 2 months ago