NVIDIA / Megatron-LM

Ongoing research training transformer models at scale
10,595Updated this week

Related projects

Alternatives and complementary repositories for Megatron-LM