ML-GSAI/LLaDA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ML-GSAI/LLaDA)

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

☆3,905

Alternatives and similar repositories for LLaDA

Users that are interested in LLaDA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DreamLM / Dream
View on GitHub
Dream 7B, a large diffusion language model
☆1,255Nov 21, 2025Updated 8 months ago
ML-GSAI / SMDM
View on GitHub
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆384Dec 22, 2024Updated last year
Gen-Verse / MMaDA
View on GitHub
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)
☆1,660Feb 14, 2026Updated 5 months ago
NVlabs / Fast-dLLM
View on GitHub
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆1,062May 30, 2026Updated last month
kuleshov-group / bd3lms
View on GitHub
[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆1,021Jul 10, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
dllm-reasoning / d1
View on GitHub
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆454Jan 26, 2026Updated 5 months ago
HKUNLP / DiffuLLaMA
View on GitHub
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆400May 31, 2025Updated last year
ML-GSAI / LLaDA-V
View on GitHub
☆347Mar 23, 2026Updated 3 months ago
ZHZisZZ / dllm
View on GitHub
dLLM: Simple Diffusion Language Modeling
☆2,652Updated this week
kuleshov-group / mdlm
View on GitHub
[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model
☆701Sep 29, 2025Updated 9 months ago
VILA-Lab / Awesome-DLMs
View on GitHub
The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".
☆1,148May 29, 2026Updated last month
pengzhangzhi / Open-dLLM
View on GitHub
Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
☆645Updated this week
Gen-Verse / dLLM-RL
View on GitHub
[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.
☆511Jan 28, 2026Updated 5 months ago
maomaocun / dLLM-cache
View on GitHub
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…
☆211May 1, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JetAstra / SDAR
View on GitHub
SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model（1.7B, 4B, 8B, 30B）
☆361Jun 2, 2026Updated last month
inclusionAI / LLaDA2.X
View on GitHub
LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.
☆443Feb 12, 2026Updated 5 months ago
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,587Updated this week
ML-GSAI / Diffusion-LLM-Papers
View on GitHub
A Collection of Papers on Diffusion Language Models
☆180Sep 15, 2025Updated 10 months ago
LiQiiiii / DLLM-Survey
View on GitHub
[Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey
☆387Apr 4, 2026Updated 3 months ago
jacklishufan / LaViDa
View on GitHub
Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding
☆227Dec 17, 2025Updated 7 months ago
SJTU-DENG-Lab / Discrete-Diffusion-Forcing
View on GitHub
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
☆261Feb 3, 2026Updated 5 months ago
ByteDance-Seed / Bagel
View on GitHub
Open-source unified multimodal model
☆6,106May 4, 2026Updated 2 months ago
apple / ml-diffucoder
View on GitHub
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
☆830Jul 9, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
inclusionAI / dInfer
View on GitHub
dInfer: An Efficient Inference Framework for Diffusion Language Models
☆474Feb 11, 2026Updated 5 months ago
louaaron / Score-Entropy-Discrete-Diffusion
View on GitHub
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
☆739Feb 29, 2024Updated 2 years ago
JiuhaiChen / BLIP3o
View on GitHub
Official implementation of BLIP3o-Series
☆1,663Nov 29, 2025Updated 7 months ago
showlab / Show-o
View on GitHub
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.
☆1,963Jan 8, 2026Updated 6 months ago
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations for emerging model architectures
☆5,388Updated this week
facebookresearch / flow_matching
View on GitHub
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes…
☆4,625Jan 5, 2026Updated 6 months ago
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year
inclusionAI / dFactory
View on GitHub
Easy and Efficient dLLM Fine-Tuning
☆261Mar 2, 2026Updated 4 months ago
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,687May 31, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yifan123 / flow_grpo
View on GitHub
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆2,423May 7, 2026Updated 2 months ago
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,679Mar 16, 2025Updated last year
baaivision / Emu3
View on GitHub
Next-Token Prediction is All You Need
☆2,432Jan 12, 2026Updated 6 months ago
yczhou001 / Awesome-Diffusion-LLM
View on GitHub
paper list, tutorial, and nano code snippet for Diffusion Large Language Models.
☆170Jan 19, 2026Updated 6 months ago
EleutherAI / lm-evaluation-harness
View on GitHub
A framework for few-shot evaluation of language models.
☆13,359Jul 13, 2026Updated last week
horseee / dKV-Cache
View on GitHub
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆135May 22, 2025Updated last year
LTH14 / mar
View on GitHub
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
☆1,942Feb 20, 2026Updated 5 months ago