xjjxmu / UniPTS
The official code for "UniPTS: A Unified Framework for Proficient Post-Training Sparsity" | [CVPR2024]
☆9Updated 7 months ago
Alternatives and similar repositories for UniPTS
Users that are interested in UniPTS are comparing it to the libraries listed below
Sorting:
- A collection of recent token reduction (token pruning, merging, clustering, etc.) techniques for ML/AI☆43Updated 2 weeks ago
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".☆103Updated 2 weeks ago
- 📚 Collection of token reduction for model compression resources.☆53Updated 2 weeks ago
- a brief repo about paper research☆15Updated 8 months ago
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆73Updated 4 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆188Updated this week
- [CVPR'24] Official implementation of paper "FreeKD: Knowledge Distillation via Semantic Frequency Prompt".☆43Updated last year
- [AAAI-2025] The offical code for SiTo (Similarity-based Token Pruning for Stable Diffusion Models)☆27Updated 3 months ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆142Updated 2 months ago
- This repository contains the official implementation of "PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language M…☆15Updated last month
- [CVPR 2025] The official implementation of "CacheQuant: Comprehensively Accelerated Diffusion Models"☆20Updated last month
- Official repository for VisionZip (CVPR 2025)☆280Updated 2 months ago
- [CVPR'25] Official implementation of paper "MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders".☆24Updated 2 months ago
- A tiny paper rating web☆36Updated last month
- MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer☆41Updated 8 months ago
- A paper list of some recent works about Token Compress for Vit and VLM☆456Updated last week
- [NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification☆21Updated last month
- An open source codebase for object detection based on Jittor☆18Updated 3 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆192Updated 3 weeks ago
- Give us minutes, we give back a faster Mamba. The official implementation of "Faster Vision Mamba is Rebuilt in Minutes via Merged Token …☆39Updated 5 months ago
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆36Updated 5 months ago
- The code for Fine-grained HBOE | AAAI 2024 (official version and optimized version).☆16Updated last year
- Code release for VTW (AAAI 2025) Oral☆39Updated 4 months ago
- A Brief Review for Computer Architecture☆19Updated 3 weeks ago
- ☆46Updated 5 months ago
- [Arxiv 2025] Efficient Reasoning Models: A Survey☆146Updated last week
- [ECCV 2024] AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer☆27Updated 5 months ago
- 📚 Collection of awesome generation acceleration resources.☆235Updated 3 weeks ago
- This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehens…☆69Updated 2 weeks ago
- This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24 Poste…☆35Updated 11 months ago