bansky-cl / Diffusion-LM-PapersLinks

Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.

☆60

Alternatives and similar repositories for Diffusion-LM-Papers

Users that are interested in Diffusion-LM-Papers are comparing it to the libraries listed below

Sorting:

bansky-cl / diffusion-nlp-paper-arxiv
Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".
☆160Updated this week
justinlovelace / latent-diffusion-for-language
☆139Updated last year
ML-GSAI / SMDM
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆267Updated 7 months ago
HKUNLP / diffusion-of-thoughts
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
☆173Updated 5 months ago
ML-GSAI / RADD
Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…
☆57Updated 2 months ago
igul222 / plaid
☆104Updated 2 years ago
hanyang1999 / discrete-diffusion-papers
A collection of papers on discrete diffusion models
☆156Updated last month
HKUNLP / DiffuLLaMA
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆259Updated 2 months ago
HKUNLP / reparam-discrete-diffusion
Reparameterized Discrete Diffusion Models for Text Generation
☆100Updated 2 years ago
zhjgao / difformer
The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)
☆55Updated last year
yczhou001 / Awesome-Diffusion-LLM
paper list, tutorial, and nano code snippet for Diffusion Large Language Models.
☆96Updated last month
yegcjs / DiffusionLLM
Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"
☆83Updated last year
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆107Updated last month
haonan3 / AnchorContext
AnchorAttention: Improved attention for LLMs long-context training
☆212Updated 6 months ago
kuleshov-group / mdlm
[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model
☆466Updated 2 months ago
MinkaiXu / Energy-Diffusion-LLM
Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation
☆25Updated 2 weeks ago
TUDB-Labs / MoE-PEFT
An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT
☆108Updated 4 months ago
RUCBM / DeepCritic
Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"
☆32Updated last month
AoiDragon / Awesome-Text-Diffusion-Models
[IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".
☆31Updated last year
dllm-reasoning / d1
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆255Updated last month
DaShenZi721 / HRA
☆31Updated 2 months ago
liusulin / DDPD
Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]
☆70Updated 3 months ago
ThreeSR / Awesome-Inference-Time-Scaling
Paper List of Inference/Test Time Scaling/Computing
☆286Updated last month
yegcjs / DINOISER
☆25Updated 3 weeks ago
yihedeng9 / rlhf-summary-notes
A brief and partial summary of RLHF algorithms.
☆131Updated 5 months ago
ML-GSAI / Scaling-Diffusion-Transformers-muP
Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".
☆81Updated last month
ML-GSAI / LLaDA-1.5
☆31Updated 2 months ago
huaishengzhu / DSPO
☆23Updated 3 months ago
google-deepmind / md4
Official Jax Implementation of MD4 Masked Diffusion Models
☆118Updated 5 months ago
GATECH-EIC / ACT
[ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…
☆40Updated last year