FelixMessi / QDLMLinks
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs
☆51Updated last week
Alternatives and similar repositories for QDLM
Users that are interested in QDLM are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models☆70Updated 4 months ago
- A Collection of Papers on Diffusion Language Models☆155Updated 4 months ago
- Paper List of Inference/Test Time Scaling/Computing☆344Updated 5 months ago
- Code release for VTW (AAAI 2025 Oral)☆64Updated 3 months ago
- 📚 Collection of token-level model compression resources.☆190Updated 5 months ago
- [ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"☆69Updated 3 weeks ago
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆103Updated 3 months ago
- [CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models☆99Updated 2 months ago
- The official implementation of dLLM-Var☆31Updated 3 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆84Updated 3 months ago
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆70Updated 4 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆153Updated 3 weeks ago
- [NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.☆86Updated 4 months ago
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆95Updated last year
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆236Updated 5 months ago
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" and "Sp…☆237Updated last month
- ☆37Updated 5 months ago
- ☆142Updated 3 weeks ago
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆37Updated last year
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆58Updated 2 weeks ago
- (CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction☆141Updated 11 months ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆121Updated 8 months ago
- [TMLR 2025] Efficient Reasoning Models: A Survey☆298Updated last week
- ☆64Updated 2 weeks ago
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆79Updated 7 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆349Updated last month
- ✨✨[AAAI 2026] This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Vi…☆77Updated 9 months ago
- [NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification☆32Updated 10 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆64Updated 4 months ago
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆104Updated last year