titu1994 / keras-LAMB-Optimizer

Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"
75Updated 5 years ago

Alternatives and similar repositories for keras-LAMB-Optimizer:

Users that are interested in keras-LAMB-Optimizer are comparing it to the libraries listed below