Gen-Verse/dLLM-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Gen-Verse/dLLM-RL)

Gen-Verse / dLLM-RL

[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.

☆512

Alternatives and similar repositories for dLLM-RL

Users that are interested in dLLM-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JetAstra / SDAR
View on GitHub
SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model（1.7B, 4B, 8B, 30B）
☆361Jun 2, 2026Updated last month
OpenMOSS / DiRL
View on GitHub
☆165Mar 30, 2026Updated 3 months ago
dllm-reasoning / d1
View on GitHub
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆453Jan 26, 2026Updated 5 months ago
inclusionAI / dFactory
View on GitHub
Easy and Efficient dLLM Fine-Tuning
☆261Mar 2, 2026Updated 4 months ago
NVlabs / Fast-dLLM
View on GitHub
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆1,063May 30, 2026Updated last month
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
pengzhangzhi / Open-dLLM
View on GitHub
Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
☆645Updated this week
Gen-Verse / MMaDA
View on GitHub
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)
☆1,660Feb 14, 2026Updated 5 months ago
inclusionAI / dInfer
View on GitHub
dInfer: An Efficient Inference Framework for Diffusion Language Models
☆475Feb 11, 2026Updated 5 months ago
SJTU-DENG-Lab / Discrete-Diffusion-Forcing
View on GitHub
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
☆261Feb 3, 2026Updated 5 months ago
facebookresearch / SPG
View on GitHub
Code for paper "SPG Sandwiched Policy Gradient for Masked Diffusion Language Models"
☆62Oct 29, 2025Updated 8 months ago
maple-research-lab / LLaDOU
View on GitHub
Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]
☆82Dec 17, 2025Updated 7 months ago
ZHZisZZ / dllm
View on GitHub
dLLM: Simple Diffusion Language Modeling
☆2,651Updated this week
ML-GSAI / LLaDA
View on GitHub
Official PyTorch implementation for "Large Language Diffusion Models"
☆3,907Jul 15, 2026Updated last week
hao-ai-lab / d3LLM
View on GitHub
[ICML 2026] d3LLM: Ultra-Fast Diffusion LLM 🚀
☆147May 1, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
czg1225 / dParallel
View on GitHub
[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs
☆65Apr 12, 2026Updated 3 months ago
DreamLM / Dream
View on GitHub
Dream 7B, a large diffusion language model
☆1,254Nov 21, 2025Updated 8 months ago
VILA-Lab / Awesome-DLMs
View on GitHub
The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".
☆1,150May 29, 2026Updated last month
LeapLabTHU / JustGRPO
View on GitHub
[ICML 2026 Outstanding Paper] Minimalist RL for Diffusion LLMs. 89.1% on GSM8K.
☆243Jul 6, 2026Updated 2 weeks ago
jacklishufan / LaViDa
View on GitHub
Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding
☆227Dec 17, 2025Updated 7 months ago
ML-GSAI / LLaDA-V
View on GitHub
☆347Mar 23, 2026Updated 3 months ago
Labman42 / JetEngine
View on GitHub
A lightweight Inference Engine built for block diffusion models
☆47Apr 12, 2026Updated 3 months ago
JinjieNi / MegaDLMs
View on GitHub
GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…
☆343Nov 11, 2025Updated 8 months ago
yjyddq / DARE
View on GitHub
Official repository of DARE: Diffusion Large Language Models Alignment and Reinforcement Executor
☆213Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ML-GSAI / ESPO
View on GitHub
Official PyTorch implementation for "Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective"
☆39Jan 25, 2026Updated 5 months ago
JinjieNi / dlms-are-super-data-learners
View on GitHub
The official github repo for "Diffusion Language Models are Super Data Learners".
☆227Nov 6, 2025Updated 8 months ago
HKUNLP / DiffuLLaMA
View on GitHub
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆400May 31, 2025Updated last year
cychomatica / FreeDave
View on GitHub
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
☆23May 19, 2026Updated 2 months ago
maomaocun / dLLM-Var
View on GitHub
The official implementation of dLLM-Var
☆35Nov 6, 2025Updated 8 months ago
martian422 / MaskGRPO
View on GitHub
The official implementation of MaskGRPO: Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models. (ICLR 2026, arxiv…
☆19Jan 27, 2026Updated 5 months ago
inclusionAI / LLaDA2.X
View on GitHub
LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.
☆444Feb 12, 2026Updated 5 months ago
maomaocun / dLLM-cache
View on GitHub
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…
☆211May 1, 2026Updated 2 months ago
OpenGVLab / SDLM
View on GitHub
Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…
☆98Dec 27, 2025Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
autonomousvision / mdpo
View on GitHub
MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models
☆45Jan 28, 2026Updated 5 months ago
apple / ml-diffucoder
View on GitHub
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
☆830Jul 9, 2025Updated last year
Li-Jinsong / DAEDAL
View on GitHub
[ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"
☆173Feb 16, 2026Updated 5 months ago
kuleshov-group / bd3lms
View on GitHub
[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆1,021Jul 10, 2025Updated last year
Auraithm / LLADA_pretraining
View on GitHub
☆31Aug 18, 2025Updated 11 months ago
SJTU-DENG-Lab / Diffulex
View on GitHub
Flexible and Pluggable Serving Engine for Diffusion LLMs
☆147Jul 13, 2026Updated last week
yu-rp / Dimple
View on GitHub
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆117Jul 9, 2025Updated last year