kuleshov-group / mdlmLinks

[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model

☆466

Alternatives and similar repositories for mdlm

Users that are interested in mdlm are comparing it to the libraries listed below

Sorting:

louaaron / Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
☆612Updated last year
kuleshov-group / awesome-discrete-diffusion-models
A curated list for awesome discrete diffusion models resources.
☆416Updated 2 months ago
ML-GSAI / SMDM
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆267Updated 7 months ago
HKUNLP / DiffuLLaMA
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆259Updated 2 months ago
google-deepmind / md4
Official Jax Implementation of MD4 Masked Diffusion Models
☆118Updated 5 months ago
dllm-reasoning / d1
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆255Updated last month
kuleshov-group / bd3lms
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆749Updated 3 weeks ago
bansky-cl / diffusion-nlp-paper-arxiv
Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".
☆160Updated this week
cloneofsimo / d3pm
Minimal Implementation of a D3PM in pytorch
☆241Updated last year
MinkaiXu / Energy-Diffusion-LLM
Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation
☆25Updated 2 weeks ago
ML-GSAI / RADD
Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…
☆57Updated 2 months ago
liusulin / DDPD
Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]
☆70Updated 3 months ago
zacharyhorvitz / Fk-Diffusion-Steering
A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.
☆174Updated last month
igul222 / plaid
☆104Updated 2 years ago
Haiyang-W / TokenFormer
[ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
☆567Updated 5 months ago
goombalab / hnet
H-Net: Hierarchical Network with Dynamic Chunking
☆632Updated last week
bansky-cl / Diffusion-LM-Papers
Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.
☆60Updated 4 months ago
justinlovelace / latent-diffusion-for-language
☆139Updated last year
ozekri / SEPO
Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"
☆27Updated 2 months ago
jzhang38 / LongMamba
Some preliminary explorations of Mamba's context scaling.
☆216Updated last year
dvruette / gidd
Code accompanying the paper "Generalized Interpolating Discrete Diffusion"
☆97Updated last month
lebellig / discrete-fm
Educational implementation of the Discrete Flow Matching paper
☆98Updated 11 months ago
madaan / minimal-text-diffusion
A minimal implementation of diffusion models for text generation
☆389Updated 2 years ago
lucidrains / nGPT-pytorch
Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI
☆289Updated 2 months ago
minyoungg / platonic-rep
☆589Updated 3 months ago
HKUNLP / diffusion-of-thoughts
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
☆173Updated 5 months ago
lucidrains / st-moe-pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
☆353Updated last year
kuleshov-group / discrete-diffusion-guidance
Simple Guidance Mechanisms for Discrete Diffusion Models
☆47Updated 7 months ago
nnaisense / bayesian-flow-networks
This is the official code release for Bayesian Flow Networks.
☆290Updated last year
HKUNLP / diffusion-vs-ar
[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"
☆69Updated 5 months ago