leloykun / adaptive-muonLinks

A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they change during training
17Updated 10 months ago

Alternatives and similar repositories for adaptive-muon

Users that are interested in adaptive-muon are comparing it to the libraries listed below

Sorting: