alibaba / Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM
628Updated 10 months ago

Related projects

Alternatives and complementary repositories for Megatron-LLaMA