Liuhong99 / Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
949Updated last year

Alternatives and similar repositories for Sophia:

Users that are interested in Sophia are comparing it to the libraries listed below