Ongoing research training transformer models at scale
☆15,985Apr 10, 2026Updated this week
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,977Apr 3, 2026Updated last week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆2,239Aug 14, 2025Updated 7 months ago
- Fast and memory-efficient exact attention☆23,185Updated this week
- Transformer related optimization, including BERT, GPT☆6,410Mar 27, 2024Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆75,637Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…☆3,256Apr 3, 2026Updated last week
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,443Apr 3, 2026Updated last week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆25,408Updated this week
- Development repository for the Triton language and compiler☆18,840Apr 4, 2026Updated last week
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆13,304Updated this week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,437Mar 20, 2024Updated 2 years ago
- Train transformer language models with reinforcement learning.☆17,967Updated this week
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)☆9,315Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆17,048Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.