KellerJordan / Muon

Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead
210Updated last week

Alternatives and similar repositories for Muon:

Users that are interested in Muon are comparing it to the libraries listed below