chengtao-lv / PTQ4SAM
[CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything
☆67Updated 9 months ago
Alternatives and similar repositories for PTQ4SAM:
Users that are interested in PTQ4SAM are comparing it to the libraries listed below
- [ECCV 2024] Isomorphic Pruning for Vision Models☆66Updated 8 months ago
- [CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric☆53Updated 2 years ago
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆87Updated last year
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆67Updated 11 months ago
- ☆33Updated last year
- One summary of efficient segment anything models☆95Updated 8 months ago
- PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005☆27Updated 5 months ago
- Post-Training Quantization for Vision transformers.☆214Updated 2 years ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆43Updated 6 months ago
- The official implementation of the AAAI 2024 paper Bi-ViT.☆10Updated last year
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆67Updated last year
- This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24 Poste…☆34Updated 10 months ago
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆82Updated 10 months ago
- [CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Mo…☆62Updated 8 months ago
- [ECCV 2024] AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer☆23Updated 4 months ago
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…☆55Updated last year
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆70Updated 3 months ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆35Updated last year
- This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!☆14Updated 7 months ago
- ☆34Updated 2 years ago
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆119Updated last year
- [ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation☆74Updated 3 weeks ago
- The official implementation of the ICML 2023 paper OFQ-ViT☆30Updated last year
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆47Updated last year
- ☆22Updated last year
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆220Updated 7 months ago
- ALGM applied to Segmenter☆24Updated 10 months ago
- ☆19Updated last year
- This is the official pytorch implementation for the paper: *Quantformer: Learning Extremely Low-precision Vision Transformers*.☆23Updated 2 years ago
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆94Updated last year