An official implement of CVPR 2023 paper - NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers
☆26Mar 13, 2024Updated 2 years ago
Alternatives and similar repositories for NoisyQuant
Users that are interested in NoisyQuant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of the ICML 2023 paper OFQ-ViT☆39Oct 3, 2023Updated 2 years ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated last year
- Post-Training Quantization for Vision transformers.☆242Jul 19, 2022Updated 3 years ago
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆58Feb 7, 2023Updated 3 years ago
- Pytorch implementation of our paper accepted by CVPR 2022 -- IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Sh…☆37Mar 2, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆28Feb 7, 2023Updated 3 years ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆49Sep 27, 2024Updated last year
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆361Apr 11, 2023Updated 3 years ago
- BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models☆39Feb 4, 2024Updated 2 years ago
- AFPQ code implementation☆23Nov 6, 2023Updated 2 years ago
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆16Nov 1, 2021Updated 4 years ago
- (ICCV 2023) Official implementation of Rectified Straight Through Estimator (ReSTE).☆34Sep 20, 2024Updated last year
- Algorithm-hardware Co-design for Deformable Convolution☆24Jan 14, 2021Updated 5 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆61Mar 19, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [CVPR 2022] Learnable Lookup Table for Neural Network Quantization☆44Oct 6, 2022Updated 3 years ago
- List of papers related to neural network quantization in recent AI conferences and journals.☆822Mar 27, 2025Updated last year
- Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"☆24Oct 25, 2023Updated 2 years ago
- ☆25Nov 22, 2024Updated last year
- ☆14Oct 6, 2023Updated 2 years ago
- Reproducing Raghu et al 2021 - Do Vision Transformers See Like Convolutional Neural Networks?☆17Aug 30, 2021Updated 4 years ago
- Benchmark tests supporting the TiledCUDA library.☆19Nov 19, 2024Updated last year
- [ICCV 2023] I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference☆204Sep 2, 2024Updated last year
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆51Oct 21, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- HWFI: Hybrid Warping Fusion for Video Frame Interpolation. IJCV 2022☆11Sep 7, 2022Updated 3 years ago
- RIFE with IFUNet, FusionNet and RefineNet☆12Jun 30, 2022Updated 3 years ago
- ☆19Nov 11, 2024Updated last year
- ☆11Jan 12, 2023Updated 3 years ago
- ☆19Mar 21, 2023Updated 3 years ago
- ☆19Mar 17, 2021Updated 5 years ago
- 在FPGA上实现SRIO收发控制器☆11Sep 30, 2022Updated 3 years ago
- PB-LLM: Partially Binarized Large Language Models☆156Nov 20, 2023Updated 2 years ago
- Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICC…☆15Nov 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Training with Block Minifloat number representation☆18May 2, 2021Updated 5 years ago
- A PyTorch Framework for Efficient Pruning and Quantization for specialized accelerators.☆37Dec 5, 2021Updated 4 years ago
- ☆30May 24, 2020Updated 6 years ago
- SQuant [ICLR22]☆131Sep 27, 2022Updated 3 years ago
- This repository contains integer operators on GPUs for PyTorch.☆236Sep 29, 2023Updated 2 years ago
- ☆37Oct 21, 2025Updated 7 months ago
- Paper list for accleration of transformers☆14Jul 1, 2023Updated 2 years ago