kyleliang919/C-Optim

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyleliang919/C-Optim)

kyleliang919 / C-Optim

[ICLR 2026] When it comes to optimizers, it's always better to be safe than sorry

☆418

Alternatives and similar repositories for C-Optim

Users that are interested in C-Optim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AnonymousAlethiometer / SGD_SaI
View on GitHub
Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"
☆55Jan 27, 2025Updated last year
Gunale0926 / Grams
View on GitHub
Grams: Gradient Descent with Adaptive Momentum Scaling (ICLR 2025 Workshop)
☆17Mar 6, 2025Updated last year
KellerJordan / Muon
View on GitHub
Muon is an optimizer for hidden layers in neural networks
☆2,747May 24, 2026Updated 2 months ago
hazdzz / cautious_adam
View on GitHub
The PyTorch implementation of Cautious-Adam.
☆18Jun 25, 2025Updated last year
zyushun / Adam-mini
View on GitHub
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
☆457May 13, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nikhilvyas / SOAP
View on GitHub
☆275Dec 2, 2024Updated last year
kyleliang919 / Super_Muon
View on GitHub
☆68Mar 21, 2025Updated last year
HomebrewML / HeavyBall
View on GitHub
Efficient optimizers
☆336Updated this week
kyleliang919 / Online-Subspace-Descent
View on GitHub
[NeurIPS 2024] Low rank memory efficient optimizer without SVD
☆33Jul 1, 2025Updated last year
facebookresearch / schedule_free
View on GitHub
Schedule-Free Optimization in PyTorch
☆2,317Jun 18, 2026Updated last month
epfml / llm-optimizer-benchmark
View on GitHub
Benchmarking Optimizers for LLM Pretraining
☆60May 3, 2026Updated 2 months ago
hxixixh / amo-release
View on GitHub
Official implementation for CVPR 2025 paper "AMO Sampler: Enhancing Text Rendering with Overshooting"
☆30May 3, 2025Updated last year
rimads / avey-b
View on GitHub
Code for the Avey-B paper (https://arxiv.org/abs/2602.15814)
☆32Feb 21, 2026Updated 5 months ago
dayal-kalra / low-memory-adam
View on GitHub
☆14Mar 2, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
bloc97 / DeMo
View on GitHub
DeMo: Decoupled Momentum Optimization
☆202Dec 2, 2024Updated last year
Roblox / SmoothCache
View on GitHub
Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.
☆48Jul 17, 2025Updated last year
NoahAmsel / PolarExpress
View on GitHub
☆33Jul 6, 2026Updated 3 weeks ago
lucidrains / lion-pytorch
View on GitHub
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
☆2,195Jul 9, 2026Updated 2 weeks ago
lzhangbv / acpsgd
View on GitHub
[ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
☆10Apr 28, 2023Updated 3 years ago
sangyun884 / rfpp
View on GitHub
The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024
☆133Oct 18, 2024Updated last year
Unakar / Spectral-Sphere-Optimizer
View on GitHub
Spectral Sphere Optimizer
☆131Mar 23, 2026Updated 4 months ago
Dao-AILab / gram-newton-schulz
View on GitHub
Fast Polar Decomposition for Muon
☆169Jul 2, 2026Updated 3 weeks ago
warner-benjamin / optimi
View on GitHub
Fast, Modern, and Low Precision PyTorch Optimizers
☆129May 16, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Liuhong99 / Sophia
View on GitHub
The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
☆1,003Jan 30, 2024Updated 2 years ago
areu01or00 / Tensor-Slayer
View on GitHub
Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…
☆28May 27, 2025Updated last year
Tinycompany-AI / SuperTokenizer
View on GitHub
Multi-Word Probabilistic based supertokenizer
☆15May 15, 2025Updated last year
iShohei220 / adopt
View on GitHub
Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"
☆438Dec 12, 2024Updated last year
Farseer-Scaling-Law / Farseer
View on GitHub
☆21Jun 12, 2025Updated last year
SonicCodes / subcloning
View on GitHub
implementation of https://arxiv.org/pdf/2312.09299
☆21Jul 3, 2024Updated 2 years ago
zhuhanqing / APOLLO
View on GitHub
APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention
☆364Nov 29, 2025Updated 8 months ago
sail-sg / Adan
View on GitHub
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
☆820Jun 8, 2025Updated last year
microsoft / mup
View on GitHub
maximal update parametrization (µP)
☆1,743Jul 17, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lucidrains / simplicial-attention
View on GitHub
Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…
☆49Sep 2, 2025Updated 10 months ago
thu-ml / CCA
View on GitHub
Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"
☆37Feb 11, 2025Updated last year
jiaweizzhao / GaLore
View on GitHub
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
☆1,699Oct 28, 2024Updated last year
USTC-StarTeam / ZIP
View on GitHub
arXiv 2024 | ZIP: entropy-law data selection for efficient LLM alignment.
☆28Jun 10, 2026Updated last month
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations for emerging model architectures
☆5,463Updated this week
evanatyourservice / llm-jax
View on GitHub
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated last year
wookiekim / HCCNet
View on GitHub
Official PyTorch implementation of HCCNet: Efficient Semantic Matching with Hypercolumn Correlation (WACV '24 Oral, Best paper finalist (…
☆11Apr 29, 2024Updated 2 years ago