alibaba / Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM
650Updated last year

Alternatives and similar repositories for Megatron-LLaMA

Users that are interested in Megatron-LLaMA are comparing it to the libraries listed below

Sorting: