andyrull / width-and-Depth-pruning-for-Vision-Transformer
☆16Updated 2 years ago
Related projects: ⓘ
- [NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…☆90Updated 9 months ago
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆40Updated last year
- ☆9Updated 11 months ago
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆45Updated 9 months ago
- ☆39Updated 3 weeks ago
- ☆40Updated last year
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆29Updated 9 months ago
- Data-Free Neural Architecture Search via Recursive Label Calibration. ECCV 2022.☆32Updated 2 years ago
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆77Updated last year
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆29Updated last year
- S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)☆63Updated 3 years ago
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer☆69Updated 2 years ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆33Updated 6 months ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆69Updated 2 years ago
- ☆23Updated last year
- ☆23Updated 2 years ago
- Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)☆68Updated 2 years ago
- [NeurIPS 2022] “M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design”, Hanxue …☆90Updated last year
- ☆16Updated 2 years ago
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆19Updated last year
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆17Updated 5 months ago
- Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction☆9Updated 2 years ago
- Official PyTorch implementation of "Evolving Search Space for Neural Architecture Search"☆10Updated 3 years ago
- ☆20Updated last year
- [ICCV-2023] EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization☆27Updated 9 months ago
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆85Updated last year
- To appear in the 11th International Conference on Learning Representations (ICLR 2023).☆16Updated last year
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆24Updated 3 years ago
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆42Updated last year
- BESA is a differentiable weight pruning technique for large language models.☆12Updated 6 months ago