wmn-231314/diffusion-data-constraint

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wmn-231314/diffusion-data-constraint)

wmn-231314 / diffusion-data-constraint

Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion models are significantly more data-efficient than standard left to right autoregressive models, due to their ability to learn from different token orderings.

☆127

Alternatives and similar repositories for diffusion-data-constraint

Users that are interested in diffusion-data-constraint are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JinjieNi / dlms-are-super-data-learners
View on GitHub
The official github repo for "Diffusion Language Models are Super Data Learners".
☆227Nov 6, 2025Updated 8 months ago
thunlp / SparsingLaw
View on GitHub
The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".
☆32Nov 12, 2024Updated last year
kuleshov-group / mdlm
View on GitHub
[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model
☆701Sep 29, 2025Updated 9 months ago
pixeli99 / MixLN
View on GitHub
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆30Jul 24, 2025Updated 11 months ago
tyshiwo1 / Awesome-Visual-Tokenizer
View on GitHub
Awesome Visual Tokenizers/Autoencoders
☆20Nov 19, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alexanderswerdlow / unidisc
View on GitHub
UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…
☆142Apr 2, 2025Updated last year
ML-GSAI / SMDM
View on GitHub
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆384Dec 22, 2024Updated last year
Jiawei-Yang / DeTok
View on GitHub
Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"
☆195Feb 24, 2026Updated 4 months ago
zaydzuhri / token-order-prediction
View on GitHub
Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"
☆48May 13, 2026Updated 2 months ago
scxue / AO-GPT-MDM
View on GitHub
Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture. Training an MDM using GPT with this repo!
☆36Jun 23, 2025Updated last year
CompVis / tread
View on GitHub
☆182Jan 8, 2026Updated 6 months ago
microsoft / ArchScale
View on GitHub
Simple & Scalable Pretraining for Neural Architecture Research
☆337Mar 31, 2026Updated 3 months ago
Li-Jinsong / DAEDAL
View on GitHub
[ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"
☆173Feb 16, 2026Updated 5 months ago
Interplay-LM-Reasoning / Interplay-LM-Reasoning
View on GitHub
[ICML 2026 Spotlight] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
☆162Jun 8, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
End2End-Diffusion / diffusion-bench
View on GitHub
Towards Holistic evaluation of Generative Diffusion Transformers!
☆98Jul 1, 2026Updated 2 weeks ago
ChenyuWang-Monica / REED
View on GitHub
Code for paper: "Learning Diffusion Models with Flexible Representation Guidance"
☆16Mar 18, 2026Updated 4 months ago
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 8 months ago
RadicalNumerics / RND1
View on GitHub
RND1: Scaling Diffusion Language Models
☆186Jun 17, 2026Updated last month
ExplainableML / HyperNoise
View on GitHub
☆70Dec 5, 2025Updated 7 months ago
wenquanlu / huginn-latent-cot
View on GitHub
[COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…
☆19Oct 4, 2025Updated 9 months ago
apple / ml-flextok
View on GitHub
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
☆322Jun 2, 2025Updated last year
kuleshov-group / bd3lms
View on GitHub
[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆1,021Jul 10, 2025Updated last year
DreamLM / DreamOn
View on GitHub
Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas
☆118Feb 3, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lukaslaobeyer / token-opt
View on GitHub
Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"
☆205Jun 10, 2025Updated last year
rimads / avey-dpa
View on GitHub
Code for the paper Don't Pay Attention
☆59Sep 25, 2025Updated 9 months ago
DreamLM / Dream
View on GitHub
Dream 7B, a large diffusion language model
☆1,255Nov 21, 2025Updated 8 months ago
jacklishufan / LaViDa
View on GitHub
Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding
☆227Dec 17, 2025Updated 7 months ago
uq-project / UQ
View on GitHub
UQ: Assessing Language Models on Unsolved Questions
☆30Aug 26, 2025Updated 10 months ago
HazyResearch / scaling-verification
View on GitHub
☆26Sep 4, 2025Updated 10 months ago
ZitengWangNYU / Scale-RAE
View on GitHub
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
☆255Feb 13, 2026Updated 5 months ago
Auraithm / LLADA_pretraining
View on GitHub
☆31Aug 18, 2025Updated 11 months ago
berlino / seq_icl
View on GitHub
☆54May 20, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
HKUNLP / DiffuLLaMA
View on GitHub
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆400May 31, 2025Updated last year
Gen-Verse / MMaDA
View on GitHub
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)
☆1,660Feb 14, 2026Updated 5 months ago
locuslab / EqR
View on GitHub
[ICML 2026] Code for Equilibrium Reasoners: learning attractor dynamics for scalable reasoning
☆44Jun 1, 2026Updated last month
kuleshov-group / proseco
View on GitHub
Learn from Your Mistakes: Self-Correcting Masked Diffusion Models
☆15Jun 25, 2026Updated 3 weeks ago
PRIME-RL / RL-Compositionality
View on GitHub
FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
☆68Jan 26, 2026Updated 5 months ago
allenai / signal-and-noise
View on GitHub
Measuring the Signal to Noise Ratio in Language Model Evaluation
☆31Aug 19, 2025Updated 11 months ago
SJTU-DENG-Lab / Discrete-Diffusion-Forcing
View on GitHub
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
☆261Feb 3, 2026Updated 5 months ago