inclusionAI / LLaDA2.0Links

LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.

☆240

Alternatives and similar repositories for LLaDA2.0

Users that are interested in LLaDA2.0 are comparing it to the libraries listed below

Sorting:

Gen-Verse / dLLM-RL
[ICLR 2026] TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
☆423Updated last week
tokenbender / mHC-manifold-constrained-hyper-connections
implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880
☆290Updated this week
inclusionAI / dFactory
Easy and Efficient dLLM Fine-Tuning
☆209Updated 2 weeks ago
HKUNLP / DiffuLLaMA
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆362Updated 8 months ago
JinjieNi / dlms-are-super-data-learners
The official github repo for "Diffusion Language Models are Super Data Learners".
☆220Updated 3 months ago
idanshen / Self-Distillation
☆236Updated last week
pengzhangzhi / Open-dLLM
The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
☆510Updated 2 months ago
horseee / dKV-Cache
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆129Updated 8 months ago
hanyang1999 / discrete-diffusion-papers
A collection of papers on discrete diffusion models
☆168Updated 7 months ago
callsys / GMPO
[ICLR 2026] Geometric-Mean Policy Optimization
☆99Updated 2 weeks ago
Infini-AI-Lab / Multiverse
☆111Updated 4 months ago
yczhou001 / Awesome-Diffusion-LLM
paper list, tutorial, and nano code snippet for Diffusion Large Language Models.
☆153Updated 3 weeks ago
dllm-reasoning / d1
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆402Updated 2 weeks ago
sail-sg / Precision-RL
Defeating the Training-Inference Mismatch via FP16
☆181Updated 2 months ago
OpenMOSS / DiRL
☆142Updated 3 weeks ago
s-sahoo / Eso-LMs
Esoteric Language Models
☆111Updated this week
DreamLM / DreamOn
Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas
☆99Updated last week
thu-ml / ReMoE
[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.
☆105Updated last year
JinjieNi / MegaDLMs
GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…
☆321Updated 2 months ago
SJTU-DENG-Lab / Discrete-Diffusion-Forcing
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
☆241Updated last week
ML-GSAI / SMDM
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆364Updated last year
wmn-231314 / diffusion-data-constraint
Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…
☆120Updated last month
TsinghuaC3I / Fourier-Position-Embedding
[ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization
☆108Updated 8 months ago
aakaran / reasoning-with-sampling
☆388Updated 3 months ago
lucidrains / coconut-pytorch
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
☆182Updated 7 months ago
GMLR-Penn / Multiplex-Thinking
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge
☆104Updated last week
jacklishufan / LaViDa
Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding
☆194Updated last month
ML-GSAI / LLaDA-1.5
☆55Updated 8 months ago
LeapLabTHU / JustGRPO
Minimalist RL for Diffusion LLMs with SOTA reasoning performance (89.1% GSM8K). Official implementation of "The Flexibility Trap".
☆115Updated 2 weeks ago
ML-GSAI / LLaDA-V
☆318Updated last month