kwai / Megatron-KwaiLinks

[USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism
54Updated 10 months ago

Alternatives and similar repositories for Megatron-Kwai

Users that are interested in Megatron-Kwai are comparing it to the libraries listed below

Sorting: