titu1994 / keras-LAMB-Optimizer

Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"
76Updated 5 years ago

Related projects

Alternatives and complementary repositories for keras-LAMB-Optimizer