facebookresearch/SPG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/SPG)

facebookresearch / SPG

Code for paper "SPG Sandwiched Policy Gradient for Masked Diffusion Language Models"

☆63

Alternatives and similar repositories for SPG

Users that are interested in SPG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuchen-zhu-zyc / DMPO
View on GitHub
[ICML 2026 Spotlight] Enhancing Reasoning For Diffusion LLMs via Distribution Matching Policy Optimization
☆19Jun 16, 2026Updated last month
dllm-reasoning / d1
View on GitHub
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆453Jan 26, 2026Updated 5 months ago
zhouc20 / HDLM
View on GitHub
Official Repository for NeurIPS 2025 Paper: Next Semantic Scale Prediction via Hierarchical Diffusion Language Models
☆35Oct 13, 2025Updated 9 months ago
Gen-Verse / dLLM-RL
View on GitHub
[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.
☆511Jan 28, 2026Updated 5 months ago
maple-research-lab / LLaDOU
View on GitHub
Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]
☆82Dec 17, 2025Updated 7 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ChenyuWang-Monica / DRAKES
View on GitHub
Code for paper: "Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design"
☆75May 10, 2025Updated last year
Whiterrrrr / BREEZE
View on GitHub
[NeurIPS 2025] The official implementation of "Towards Robust Zero-Shot Reinforcement Learning"
☆15Jan 2, 2026Updated 6 months ago
inclusionAI / dFactory
View on GitHub
Easy and Efficient dLLM Fine-Tuning
☆261Mar 2, 2026Updated 4 months ago
JianyuanZhong / StableDRL
View on GitHub
☆15Updated this week
kuleshov-group / setdlms
View on GitHub
[ICML 2026] Set Diffusion: Interpolating Token Orderings between Autoregression and Diffusion for Fast and Flexible Decoding
☆21Updated this week
LeapLabTHU / JustGRPO
View on GitHub
[ICML 2026 Outstanding Paper] Minimalist RL for Diffusion LLMs. 89.1% on GSM8K.
☆246Jul 6, 2026Updated 2 weeks ago
ML-GSAI / LLaDA-V
View on GitHub
☆347Mar 23, 2026Updated 4 months ago
hao-ai-lab / d3LLM
View on GitHub
[ICML 2026] d3LLM: Ultra-Fast Diffusion LLM 🚀
☆147May 1, 2026Updated 2 months ago
autonomousvision / mdpo
View on GitHub
MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models
☆45Jan 28, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
cychomatica / FreeDave
View on GitHub
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
☆23May 19, 2026Updated 2 months ago
NVlabs / Fast-dLLM
View on GitHub
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆1,063May 30, 2026Updated last month
JetAstra / SDAR
View on GitHub
SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model（1.7B, 4B, 8B, 30B）
☆362Jun 2, 2026Updated last month
phymhan / S2D2
View on GitHub
☆16Jun 17, 2026Updated last month
apple / ml-rl-dllm
View on GitHub
Repository companioning the paper "Learning Unmasking Policies for Diffusion Language Models"
☆18Mar 30, 2026Updated 3 months ago
viiika / Prism
View on GitHub
[ICML 2026] Official Implementation of Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diff…
☆22Mar 4, 2026Updated 4 months ago
pengzhangzhi / Open-dLLM
View on GitHub
Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
☆645Updated this week
kuleshov-group / e2d2
View on GitHub
[NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
☆47Oct 29, 2025Updated 8 months ago
ims-kdks / Learning-to-Parallel-Decoding
View on GitHub
[ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding
☆34Jan 27, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nkalyanv99 / UNI-D2
View on GitHub
☆54Jul 8, 2026Updated 2 weeks ago
ML-GSAI / LLaDA-1.5
View on GitHub
☆55Apr 14, 2026Updated 3 months ago
maple-research-lab / RemeDi
View on GitHub
Official inference implementation of the paper "DON'T SETTLE TOO EARLY: SELF-REFLECTIVE REMASKING FOR DIFFUSION LANGUAGE MODELS". [ICLR 2…
☆15Jan 28, 2026Updated 5 months ago
maomaocun / dLLM-Var
View on GitHub
The official implementation of dLLM-Var
☆35Nov 6, 2025Updated 8 months ago
brianlck / FlexMDM
View on GitHub
☆55Sep 10, 2025Updated 10 months ago
Labman42 / JetEngine
View on GitHub
A lightweight Inference Engine built for block diffusion models
☆47Apr 12, 2026Updated 3 months ago
horseee / dKV-Cache
View on GitHub
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆135May 22, 2025Updated last year
kuleshov-group / proseco
View on GitHub
Learn from Your Mistakes: Self-Correcting Masked Diffusion Models
☆15Jun 25, 2026Updated 3 weeks ago
Gen-Verse / MMaDA
View on GitHub
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)
☆1,660Feb 14, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
s-sahoo / Eso-LMs
View on GitHub
[ICML 2026] Esoteric Language Models
☆121Jul 13, 2026Updated last week
More2Search / Awesome-Search-LLM
View on GitHub
☆21Oct 17, 2025Updated 9 months ago
Multimedia-Semantic-Analytics-Lab / PerceptionDLM
View on GitHub
Official Repo For PerceptionDLM Codebase
☆77Jun 22, 2026Updated last month
AIDASLab / Awesome-Diffusion-LLM
View on GitHub
A comprehensive list of papers about Large-Language-Diffusion-Models.
☆91Jun 4, 2026Updated last month
t6-thu / H2Oplus
View on GitHub
[ICRA'25] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
☆13Apr 10, 2025Updated last year
siyan-zhao / decision-stacks
View on GitHub
Implementation of Decision Stacks: Flexible RL via Modular Generative Models [NeurIPS 2023]
☆12Jun 27, 2023Updated 3 years ago
kuleshov-group / bd3lms
View on GitHub
[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆1,023Jul 10, 2025Updated last year