alibaba / Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM
638Updated last year

Alternatives and similar repositories for Megatron-LLaMA:

Users that are interested in Megatron-LLaMA are comparing it to the libraries listed below