Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache).
☆198Nov 17, 2025Updated 3 months ago
Alternatives and similar repositories for dLLM-cache
Users that are interested in dLLM-cache are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆40Jul 18, 2025Updated 7 months ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆852Jan 28, 2026Updated last month
- (ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆74Feb 9, 2026Updated 3 weeks ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆24Mar 4, 2025Updated 11 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆130May 22, 2025Updated 9 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,609Nov 12, 2025Updated 3 months ago
- The official implementation of dLLM-Var☆30Nov 6, 2025Updated 3 months ago
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆162Feb 16, 2026Updated 2 weeks ago
- Compression for Foundation Models☆35Jul 21, 2025Updated 7 months ago
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆37Feb 10, 2026Updated 3 weeks ago
- [Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey☆368Nov 1, 2025Updated 4 months ago
- ☆325Dec 16, 2025Updated 2 months ago
- A Collection of Papers on Diffusion Language Models☆157Sep 15, 2025Updated 5 months ago
- 📚 Collection of awesome generation acceleration resources.☆388Jul 7, 2025Updated 7 months ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆103Feb 3, 2026Updated last month
- Data distillation benchmark☆72Jun 13, 2025Updated 8 months ago
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆51Jan 28, 2026Updated last month
- [EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning☆34Jan 11, 2026Updated last month
- ☆147Jan 20, 2026Updated last month
- The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".☆818Feb 3, 2026Updated 3 weeks ago
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆367Dec 22, 2024Updated last year
- [NeurIPS'25] SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning☆40Oct 14, 2025Updated 4 months ago
- Esoteric Language Models☆111Feb 8, 2026Updated 3 weeks ago
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆13Dec 31, 2024Updated last year
- NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading☆84Jun 16, 2025Updated 8 months ago
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆174Jul 10, 2024Updated last year
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆435Jan 28, 2026Updated last month
- A model serving framework for various research and production scenarios. Seamlessly built upon the PyTorch and HuggingFace ecosystem.☆23Oct 11, 2024Updated last year
- The Official Implementation of Ada-KV [NeurIPS 2025]☆128Nov 26, 2025Updated 3 months ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- Packed Malware Analyzer (PACKMAN)☆12Jan 31, 2016Updated 10 years ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆410Jan 26, 2026Updated last month
- SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)☆333Dec 15, 2025Updated 2 months ago
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".☆272Feb 24, 2026Updated last week
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆17Jun 19, 2025Updated 8 months ago
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 6 months ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)☆1,578Feb 14, 2026Updated 2 weeks ago
- [ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆117Dec 12, 2025Updated 2 months ago