ML-GSAI / SMDMLinks

Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"

☆267

Alternatives and similar repositories for SMDM

Users that are interested in SMDM are comparing it to the libraries listed below

Sorting:

HKUNLP / DiffuLLaMA
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆253Updated 2 months ago
dllm-reasoning / d1
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆255Updated last month
kuleshov-group / mdlm
[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model
☆466Updated 2 months ago
HKUNLP / diffusion-of-thoughts
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
☆172Updated 5 months ago
bansky-cl / diffusion-nlp-paper-arxiv
Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".
☆160Updated this week
hanyang1999 / discrete-diffusion-papers
A collection of papers on discrete diffusion models
☆153Updated last month
louaaron / Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
☆610Updated last year
ML-GSAI / RADD
Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…
☆57Updated 2 months ago
kuleshov-group / bd3lms
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆749Updated 3 weeks ago
bansky-cl / Diffusion-LM-Papers
Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.
☆59Updated 4 months ago
haonan3 / AnchorContext
AnchorAttention: Improved attention for LLMs long-context training
☆212Updated 6 months ago
google-deepmind / md4
Official Jax Implementation of MD4 Masked Diffusion Models
☆118Updated 5 months ago
igul222 / plaid
☆104Updated 2 years ago
HKUNLP / diffusion-vs-ar
[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"
☆68Updated 5 months ago
yczhou001 / Awesome-Diffusion-LLM
paper list, tutorial, and nano code snippet for Diffusion Large Language Models.
☆96Updated last month
facebookresearch / PhysicsLM4
Physics of Language Models, Part 4
☆204Updated last week
jzhang38 / LongMamba
Some preliminary explorations of Mamba's context scaling.
☆216Updated last year
dvruette / gidd
Code accompanying the paper "Generalized Interpolating Discrete Diffusion"
☆97Updated last month
kuleshov-group / awesome-discrete-diffusion-models
A curated list for awesome discrete diffusion models resources.
☆416Updated 2 months ago
multimodal-art-projection / LatentCoT-Horizon
📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.
☆171Updated last week
NVlabs / Fast-dLLM
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆320Updated this week
MinkaiXu / Energy-Diffusion-LLM
Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation
☆24Updated 2 weeks ago
eric-ai-lab / Soft-Thinking
Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"
☆200Updated last week
jxiw / MambaInLlama
[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models
☆225Updated 3 months ago
DreamLM / Dream
Dream 7B, a large diffusion language model
☆873Updated last month
maple-research-lab / LLaDOU
Large Language Diffusion with Ordered Unmasking
☆44Updated last week
lucidrains / coconut-pytorch
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
☆178Updated last month
ruixin31 / Spurious_Rewards
☆322Updated last week
ypwang61 / One-Shot-RLVR
official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”
☆337Updated last week
liusulin / DDPD
Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]
☆70Updated 3 months ago