leloykun / adaptive-muon
View external linksLinks

A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they change during training
19Jan 11, 2025Updated last year

Alternatives and similar repositories for adaptive-muon

Users that are interested in adaptive-muon are comparing it to the libraries listed below

Sorting:

Are these results useful?