kuleshov-group / remdmView external linksLinks
Remasking Discrete Diffusion Models with Inference-Time Scaling
☆66Feb 7, 2026Updated last week
Alternatives and similar repositories for remdm
Users that are interested in remdm are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Simple and Effective Masked Diffusion Language Model☆619Sep 29, 2025Updated 4 months ago
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]☆73Dec 17, 2025Updated last month
- Simple Guidance Mechanisms for Discrete Diffusion Models☆71Dec 16, 2024Updated last year
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆404Jan 26, 2026Updated 3 weeks ago
- Official Code for "Rethinking Diffusion Model in High Dimension"☆24May 20, 2025Updated 8 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 6 months ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆84Apr 24, 2025Updated 9 months ago
- ☆12Oct 7, 2024Updated last year
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆31May 19, 2025Updated 8 months ago
- A curated list for awesome discrete diffusion models resources.☆533Sep 9, 2025Updated 5 months ago
- ☆16Jun 10, 2025Updated 8 months ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆16Oct 18, 2025Updated 3 months ago
- The official implementation of "Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation"☆24Feb 2, 2026Updated last week
- [ICLR 2025] SDTT: a simple and effective distillation method for discrete diffusion models☆47Sep 10, 2025Updated 5 months ago
- ☆16Dec 23, 2023Updated 2 years ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated 10 months ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆52Dec 7, 2025Updated 2 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆365May 31, 2025Updated 8 months ago
- Easy and Efficient dLLM Fine-Tuning☆212Jan 21, 2026Updated 3 weeks ago
- ☆35May 16, 2025Updated 9 months ago
- ☆18Oct 14, 2024Updated last year
- ☆21Jul 25, 2025Updated 6 months ago
- Improving large language models with concept-aware fine-tuning (CAFT)☆29Jan 31, 2026Updated 2 weeks ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆79May 30, 2025Updated 8 months ago
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆162Sep 12, 2025Updated 5 months ago
- ☆26Aug 21, 2025Updated 5 months ago
- [ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆956Jul 10, 2025Updated 7 months ago
- Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation☆61Jul 22, 2025Updated 6 months ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 3 months ago
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆20Jan 16, 2025Updated last year
- ☆20May 7, 2025Updated 9 months ago
- Official repository of DialSim☆28Oct 31, 2025Updated 3 months ago
- ☆28Apr 22, 2025Updated 9 months ago
- R3: Robust Rubric-Agnostic Reward Models☆20Jul 12, 2025Updated 7 months ago
- [ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)☆701Feb 29, 2024Updated last year
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆54Feb 20, 2025Updated 11 months ago
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling" (NeurIPS 2025).☆60Sep 25, 2025Updated 4 months ago
- Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…☆28Jan 21, 2025Updated last year
- MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models☆40Jan 28, 2026Updated 2 weeks ago