inclusionAI / LLaDA2.XLinks

LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.

☆319

Alternatives and similar repositories for LLaDA2.X

Users that are interested in LLaDA2.X are comparing it to the libraries listed below

Sorting:

Gen-Verse / dLLM-RL
[ICLR 2026] TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
☆423Updated 2 weeks ago
JinjieNi / dlms-are-super-data-learners
The official github repo for "Diffusion Language Models are Super Data Learners".
☆221Updated 3 months ago
JinjieNi / MegaDLMs
GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…
☆322Updated 3 months ago
inclusionAI / dFactory
Easy and Efficient dLLM Fine-Tuning
☆209Updated 3 weeks ago
idanshen / Self-Distillation
☆236Updated last week
HKUNLP / DiffuLLaMA
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆362Updated 8 months ago
aakaran / reasoning-with-sampling
☆388Updated 3 months ago
Infini-AI-Lab / Multiverse
☆111Updated 5 months ago
pengzhangzhi / Open-dLLM
The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
☆519Updated 3 months ago
tokenbender / mHC-manifold-constrained-hyper-connections
implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880
☆290Updated this week
s-sahoo / Eso-LMs
Esoteric Language Models
☆111Updated this week
hanyang1999 / discrete-diffusion-papers
A collection of papers on discrete diffusion models
☆168Updated 7 months ago
dllm-reasoning / d1
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆402Updated 2 weeks ago
lasgroup / SDPO
Reinforcement Learning via Self-Distillation (SDPO)
☆285Updated last week
ZihanWang314 / CoE
Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models
☆228Updated 3 months ago
callsys / GMPO
[ICLR 2026] Geometric-Mean Policy Optimization
☆100Updated 2 weeks ago
sail-sg / Precision-RL
Defeating the Training-Inference Mismatch via FP16
☆182Updated 2 months ago
Unakar / Spectral-Sphere-Optimizer
Spectral Sphere Optimizer
☆96Updated 3 weeks ago
horseee / dKV-Cache
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆129Updated 8 months ago
ML-GSAI / SMDM
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆364Updated last year
thu-ml / ReMoE
[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.
☆105Updated last year
LeapLabTHU / JustGRPO
Minimalist RL for Diffusion LLMs with SOTA reasoning performance (89.1% GSM8K). Official implementation of "The Flexibility Trap".
☆115Updated 2 weeks ago
SJTU-DENG-Lab / Discrete-Diffusion-Forcing
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
☆241Updated last week
yczhou001 / Awesome-Diffusion-LLM
paper list, tutorial, and nano code snippet for Diffusion Large Language Models.
☆154Updated 3 weeks ago
ypwang61 / One-Shot-RLVR
[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
☆408Updated 2 months ago
Xuekai-Zhu / FlowRL
☆154Updated 2 months ago
GMLR-Penn / Multiplex-Thinking
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge
☆105Updated last week
eric-ai-lab / Soft-Thinking
Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"
☆311Updated 2 weeks ago
facebookresearch / PhysicsLM4
Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality
☆317Updated last month
PRIME-RL / RL-Compositionality
FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
☆60Updated 2 weeks ago