KellerJordan / top-sgd

Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)
12Updated last year

Alternatives and similar repositories for top-sgd:

Users that are interested in top-sgd are comparing it to the libraries listed below