huggingface/Megatron-LM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huggingface/Megatron-LM)

huggingface / Megatron-LM

Ongoing research training transformer models at scale

☆19

Alternatives and similar repositories for Megatron-LM

Users that are interested in Megatron-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huggingface / model-evaluator
View on GitHub
Evaluate Transformers from the Hub 🔥
☆14May 26, 2026Updated last month
huggingface / neuralcoref-viz
View on GitHub
✨ Web interface for NeuralCoref coreference resolution
☆35May 15, 2023Updated 3 years ago
OSU-NLP-Group / Explorer
View on GitHub
[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
☆29Feb 17, 2026Updated 4 months ago
microsoft / Annotation-Factory
View on GitHub
Simple Python package for converting between CustomVision <-> Pascal VOC <-> YOLO annotations
☆19Nov 28, 2022Updated 3 years ago
L1aoXingyu / llm-infer-bench
View on GitHub
☆12Sep 1, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
huggingface / hffs
View on GitHub
**ARCHIVED** Filesystem interface to 🤗 Hub
☆60Apr 6, 2023Updated 3 years ago
yizhilll / CIF-Bench
View on GitHub
☆18Feb 29, 2024Updated 2 years ago
lewangdev / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆13Jul 15, 2024Updated last year
jiazhihao / attention_superoptimizer
View on GitHub
An Attention Superoptimizer
☆22Jan 20, 2025Updated last year
huggingface / ml-agents
View on GitHub
Unity Machine Learning Agents Toolkit
☆49Jun 8, 2023Updated 3 years ago
Mgrsc / Re-Mark
View on GitHub
一个基于原生浏览器书签的知识库：用 GitHub Gist 跨浏览器同步书签，并用 AI 为书签生成摘要、标签和封面，提供一个简洁的 Web 端浏览体验。
☆34May 25, 2026Updated last month
PhilippeMorere / EMU-Q
View on GitHub
Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.
☆10Nov 8, 2018Updated 7 years ago
lucidrains / esbn-transformer
View on GitHub
An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols
☆16Aug 3, 2021Updated 4 years ago
cambridgeltl / composable-sft
View on GitHub
A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
☆75Aug 9, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
abap34 / medCon2021-1st-place-solution
View on GitHub
1st place solution of 🦾😢 in https://www.kaggle.com/c/ai-medical-contest-2021/
☆10Apr 2, 2021Updated 5 years ago
huhailinguist / ChineseNLIProbing
View on GitHub
☆10Oct 17, 2021Updated 4 years ago
dsl-learn / triton-tutorial
View on GitHub
Getting Started with Triton: A Tutorial for Python Beginners
☆60Mar 26, 2026Updated 3 months ago
RLsys-Foundation / APRIL
View on GitHub
APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…
☆60Oct 11, 2025Updated 9 months ago
chenjie97 / SimBert_PyTorch
View on GitHub
☆32Jul 5, 2021Updated 5 years ago
matcom / autoexam
View on GitHub
A simple exam generator and grader written in Python with OpenCV
☆14Jan 14, 2026Updated 5 months ago
SKRohit / Improving-YOLOv3
View on GitHub
Few training heuristics and small architectural changes that can significantly improve YOLOv3 performance with tiny increase in inference…
☆13May 10, 2020Updated 6 years ago
OoDBag / VisTA
View on GitHub
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
☆27May 31, 2025Updated last year
huggingface / workshops
View on GitHub
Materials for workshops on the Hugging Face ecosystem
☆151May 26, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yarikama / Agentic-Advanced-RAG
View on GitHub
Building a multi-agent RAG system with advanced RAG methods
☆13Jan 12, 2025Updated last year
HorizonRobotics / nuplan-devkit
View on GitHub
☆13Feb 5, 2025Updated last year
pengzhendong / asr-decoder
View on GitHub
CTC decoder with hotwords for ASR.
☆36Jun 15, 2026Updated 3 weeks ago
alexjc / nanogpt-speedrun
View on GitHub
NanoGPT (124M) in 5 minutes
☆15Feb 14, 2025Updated last year
CJackHwang / RadioNowhere
View on GitHub
Agent驱动的实时广播电台实验性项目
☆38Feb 8, 2026Updated 5 months ago
stepbuystep / LightNAS
View on GitHub
You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms
☆12Apr 17, 2023Updated 3 years ago
apalle1 / Sentiment-Span-Extraction-Using-Transformer-Models
View on GitHub
PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…
☆11Jul 10, 2020Updated 6 years ago
adambarbato / ComfyUI-Segmentation-Agent
View on GitHub
LLM-based character segmentation agent for ComfyUI based on SAM 3 and the SAM 3 Agent notebook
☆28Dec 22, 2025Updated 6 months ago
billmuch / matmul_perf_test
View on GitHub
☆15Apr 15, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thunlp / Model_Emotion
View on GitHub
Neuron Activation
☆28Nov 21, 2024Updated last year
havenpersona / lycon
View on GitHub
Copyright-free Artificial Lyrics Dataset (ISMIR 2024 LBD)
☆12Sep 1, 2024Updated last year
DataXujing / Bert_TensorRT
View on GitHub
Bert TensorRT模型加速部署
☆10Apr 1, 2022Updated 4 years ago
M-e-r-c-u-r-y / pytorch-transformers
View on GitHub
Collection of different types of transformers for learning purposes
☆12Jan 30, 2020Updated 6 years ago
hyc2026 / M3-Agent-Training
View on GitHub
☆29Mar 30, 2026Updated 3 months ago
AICourseTeamHatchet / VrepMotionPlanning
View on GitHub
Using Vrep to simulate a six-legged robot to do motion planning & path planning
☆10Jan 10, 2019Updated 7 years ago
xieh97 / dcase2023-audio-retrieval
View on GitHub
Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge
☆10Aug 8, 2023Updated 2 years ago