KellerJordan / top-sgd

Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)
12Updated last year

Related projects

Alternatives and complementary repositories for top-sgd