guotong1988 / BERT-GPU

multi-gpu pre-training in one machine for BERT from scratch without horovod (Data Parallelism)
172Updated 3 months ago

Alternatives and similar repositories for BERT-GPU:

Users that are interested in BERT-GPU are comparing it to the libraries listed below