NVIDIA / Megatron-LM

Ongoing research training transformer models at scale
10,480Updated this week

Related projects

Alternatives and complementary repositories for Megatron-LM