mit-han-lab / sparsevitLinks
[CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
☆74Updated last year
Alternatives and similar repositories for sparsevit
Users that are interested in sparsevit are comparing it to the libraries listed below
Sorting:
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆30Updated last year
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆101Updated 2 years ago
- This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24 Poste…☆37Updated last year
- [ECCV 2024] Isomorphic Pruning for Vision Models☆77Updated last year
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆80Updated last year
- [ECCV 2024] AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer☆34Updated 9 months ago
- (ICLR 2025) BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models☆24Updated last year
- [NeurIPS 2022] “M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design”, Hanxue …☆131Updated 2 years ago
- [ICLR 2025] Mixture Compressor for Mixture-of-Experts LLMs Gains More☆54Updated 7 months ago
- [CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".☆248Updated 2 years ago
- ☆12Updated 2 years ago
- ☆35Updated 2 years ago
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆34Updated 2 years ago
- Official implementation of "SViT: Revisiting Token Pruning for Object Detection and Instance Segmentation"☆35Updated last year
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆31Updated last year
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆30Updated 6 months ago
- The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"☆59Updated 6 months ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆46Updated last year
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆22Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆38Updated last year
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆96Updated 2 years ago
- The codebase for paper "PPT: Token Pruning and Pooling for Efficient Vision Transformer"☆26Updated 10 months ago
- ☆47Updated 2 years ago
- Official code for ICCV 2023 paper "Convolutional Networks with Oriented 1D Kernels"☆47Updated last year
- Official repository of InLine attention (NeurIPS 2024)☆55Updated 9 months ago
- [ECCV 2024] SparseRefine: Sparse Refinement for Efficient High-Resolution Semantic Segmentation☆12Updated 8 months ago
- ImageNet-1K data download, processing for using as a dataset☆113Updated 2 years ago
- ALGM applied to Segmenter☆30Updated last year
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆223Updated last year
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆72Updated last year