kwai / Megatron-Kwai

[USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism
45Updated 3 months ago

Related projects

Alternatives and complementary repositories for Megatron-Kwai