ProjectD-AI / LLaMA-Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2
69Updated last year

Related projects

Alternatives and complementary repositories for LLaMA-Megatron-DeepSpeed