xiaohangt/wd1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xiaohangt/wd1)

xiaohangt / wd1

Official Implementation of wd1

☆32

Alternatives and similar repositories for wd1

Users that are interested in wd1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ML-GSAI / ESPO
View on GitHub
Official PyTorch implementation for "Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective"
☆39Jan 25, 2026Updated 5 months ago
maple-research-lab / LLaDOU
View on GitHub
Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]
☆82Dec 17, 2025Updated 7 months ago
autonomousvision / mdpo
View on GitHub
MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models
☆45Jan 28, 2026Updated 5 months ago
dllm-reasoning / d1
View on GitHub
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆454Jan 26, 2026Updated 5 months ago
kuleshov-group / remdm
View on GitHub
Remasking Discrete Diffusion Models with Inference-Time Scaling
☆77Feb 7, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
IBM / soft-masked-diffusion-language-models
View on GitHub
Code for accepted paper at ICLR 2026
☆15May 19, 2026Updated 2 months ago
multimodal-art-projection / TreePO
View on GitHub
☆65Mar 30, 2026Updated 3 months ago
JetAstra / SDAR
View on GitHub
SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model（1.7B, 4B, 8B, 30B）
☆361Jun 2, 2026Updated last month
czg1225 / dParallel
View on GitHub
[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs
☆65Apr 12, 2026Updated 3 months ago
LeapLabTHU / JustGRPO
View on GitHub
[ICML 2026 Outstanding Paper] Minimalist RL for Diffusion LLMs. 89.1% on GSM8K.
☆230Jul 6, 2026Updated 2 weeks ago
AndreHe02 / rewarding-unlikely-release
View on GitHub
☆15Jun 10, 2025Updated last year
martian422 / MaskGRPO
View on GitHub
The official implementation of MaskGRPO: Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models. (ICLR 2026, arxiv…
☆19Jan 27, 2026Updated 5 months ago
NEUIR / Uncode
View on GitHub
[ACL '26] Source code for paper "Empirical Analysis of Decoding Biases in Masked Diffusion Models"
☆44Jun 26, 2026Updated 3 weeks ago
OpenMOSS / LongLLaDA
View on GitHub
[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
☆55Dec 7, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Leey21 / A-Data-Centric-Study
View on GitHub
☆18Mar 2, 2026Updated 4 months ago
inclusionAI / dFactory
View on GitHub
Easy and Efficient dLLM Fine-Tuning
☆262Mar 2, 2026Updated 4 months ago
JianyuanZhong / StableDRL
View on GitHub
☆15Updated this week
abdelfattah-lab / SplitReason
View on GitHub
☆20Mar 18, 2026Updated 4 months ago
chinsengi / dUltra-os
View on GitHub
dUltra: Ultra-Fast Diffusion Large Language Models via Reinforcement Learning
☆16Jul 11, 2026Updated last week
DreamLM / Dream
View on GitHub
Dream 7B, a large diffusion language model
☆1,255Nov 21, 2025Updated 8 months ago
maple-research-lab / RemeDi
View on GitHub
Official inference implementation of the paper "DON'T SETTLE TOO EARLY: SELF-REFLECTIVE REMASKING FOR DIFFUSION LANGUAGE MODELS". [ICLR 2…
☆15Jan 28, 2026Updated 5 months ago
pixeli99 / Prophet
View on GitHub
Official implementation of "Diffusion Language Models Know the Answer Before Decoding"
☆60Apr 28, 2026Updated 2 months ago
ZhangXJ199 / EDGE-GRPO
View on GitHub
Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity
☆22Aug 28, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
whj363636 / Self-Ensemble-Adversarial-Training
View on GitHub
SEAT
☆21Oct 10, 2023Updated 2 years ago
ByteDance-Seed / Stable-DiffCoder
View on GitHub
Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …
☆83Mar 9, 2026Updated 4 months ago
THU-BPM / Watermarked_LLM_Identification
View on GitHub
Code and data for paper "Can Watermarked LLMs be Identified by Users via Crafted Prompts?" Accepted by ICLR 2025 (Spotlight)
☆28Dec 28, 2024Updated last year
horseee / dKV-Cache
View on GitHub
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆135May 22, 2025Updated last year
uservan / speculative_thinking
View on GitHub
☆34Oct 13, 2025Updated 9 months ago
wenquanlu / huginn-latent-cot
View on GitHub
[COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…
☆19Oct 4, 2025Updated 9 months ago
Edmond1Cheng / MBDPO
View on GitHub
Scaling World-Model Reinforcement Learning Through Diffusion Policy Optimization
☆16Jun 25, 2026Updated 3 weeks ago
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
Relaxed-System-Lab / UltraLLaDA
View on GitHub
We introduce UltraLLaDA , a scaled variant of LLaDA-8B-Base that extends the context length up to 128K tokens with light-weight post-trai…
☆15Oct 23, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
amazon-science / doscond
View on GitHub
☆19Jul 10, 2023Updated 3 years ago
aim-uofa / dLLM-MidTruth
View on GitHub
[ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".
☆66Mar 5, 2026Updated 4 months ago
maomaocun / dLLM-cache
View on GitHub
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…
☆211May 1, 2026Updated 2 months ago
ArminAzizi98 / LaMDA
View on GitHub
☆15Nov 7, 2024Updated last year
HKUNLP / DiffuLLaMA
View on GitHub
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆400May 31, 2025Updated last year
MZhouke / RL-Scheduling
View on GitHub
Code base for publication: Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems
☆10Feb 1, 2023Updated 3 years ago
PPPP-kaqiu / Awesome-Parallel-Reasoning
View on GitHub
Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.
☆54Mar 8, 2026Updated 4 months ago