Ongoing research training transformer models at scale
☆40May 15, 2026Updated last week
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MAD (Model Automation and Dashboarding)☆36Updated this week
- ☆68May 16, 2026Updated last week
- ☆11Jun 29, 2021Updated 4 years ago
- The goal of the OSSCI Fleet is to provide a central mechanism to enable test automation, batch job scheduling, and developer access to a …☆13Apr 28, 2026Updated 3 weeks ago
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆26Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆30Updated this week
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated last year
- Fast and memory-efficient exact attention☆231Updated this week
- ☆12Mar 22, 2022Updated 4 years ago
- a small C++ lattice library☆15Jan 9, 2020Updated 6 years ago
- ☆61Sep 15, 2023Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆91Updated this week
- A PyTorch native platform for training generative AI models☆17Apr 21, 2026Updated last month
- AI Tensor Engine for ROCm☆440Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆24Oct 9, 2025Updated 7 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- ☆20Sep 8, 2025Updated 8 months ago
- A high-performance acceleration library dedicated to large-scale model training on AMD GPUs☆64Updated this week
- R package for unleashing the power of NVIDIA GPU's☆16Jun 4, 2016Updated 9 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror☆532May 15, 2026Updated last week
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆18Dec 19, 2024Updated last year
- Kubernetes operator which sets up all platform tools to have a cluster ready for applications to run.☆18Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆418Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆29Apr 4, 2024Updated 2 years ago
- Do not use averages with Likert scale data☆12Jul 26, 2017Updated 8 years ago
- ☆77Jun 20, 2025Updated 11 months ago
- python package of rocm-smi-lib☆25Dec 15, 2025Updated 5 months ago
- Random collections of my interested research papers / projects☆20May 20, 2021Updated 5 years ago
- Official implementation for the paper "Understanding Hyperdimensional Computing for Parallel Single-Pass Learning"☆25Jun 10, 2023Updated 2 years ago
- R Bindings for the UCR Suite for fast time series subsequence search☆17Mar 7, 2026Updated 2 months ago
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆188Updated this week
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆17Sep 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Microsoft Collective Communication Library☆66Nov 23, 2024Updated last year
- ☆167May 15, 2026Updated last week
- ☆11Jun 24, 2021Updated 4 years ago
- ☆16Apr 30, 2026Updated 3 weeks ago
- ☆20Sep 16, 2023Updated 2 years ago
- Tese doutourado Abrahao MT, 2016☆15Oct 2, 2018Updated 7 years ago
- run commands in a container environment without root☆11Nov 1, 2016Updated 9 years ago