ML-GSAI / Diffusion-LLM-PapersLinks
A Collection of Papers on Diffusion Language Models
☆131Updated 3 weeks ago
Alternatives and similar repositories for Diffusion-LLM-Papers
Users that are interested in Diffusion-LLM-Papers are comparing it to the libraries listed below
Sorting:
- ☆237Updated 2 weeks ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆116Updated 3 months ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆105Updated 4 months ago
- ☆60Updated 4 months ago
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆75Updated 2 months ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆25Updated this week
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆212Updated 5 months ago
- A collection of papers on discrete diffusion models☆162Updated 3 months ago
- Paper List of Inference/Test Time Scaling/Computing☆308Updated last month
- The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆281Updated 3 weeks ago
- TraceRL: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆226Updated last week
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆101Updated 10 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Updated 5 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆71Updated 8 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆182Updated 3 weeks ago
- Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)☆168Updated 2 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆151Updated 4 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆91Updated 2 weeks ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆157Updated 3 weeks ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆51Updated 2 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆103Updated 4 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆86Updated 7 months ago
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆100Updated 2 months ago
- ☆52Updated last month
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆301Updated last week
- Code release for VTW (AAAI 2025 Oral)☆50Updated 2 months ago
- One-shot Entropy Minimization☆185Updated 3 months ago
- LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models☆149Updated last week
- A Massive Multi-Discipline Lecture Understanding Benchmark☆30Updated 3 months ago
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models"☆49Updated 2 months ago