[CVPR 2025] APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
☆44Apr 7, 2025Updated last year
Alternatives and similar repositories for APHQ-ViT
Users that are interested in APHQ-ViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2024] AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer☆43Dec 9, 2024Updated last year
- ☆19Feb 4, 2025Updated last year
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆29Jun 16, 2025Updated last year
- Official PyTorch implementation of QwT—“Quantization without Tears” (CVPR 2025): fast, accurate, and hassle-free post-training network qu…☆32Sep 30, 2025Updated 9 months ago
- Structured Binary Neural Networks for Image Recognition☆16Oct 12, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆131Sep 23, 2025Updated 9 months ago
- The official implementation of the ICML 2023 paper OFQ-ViT☆39Oct 3, 2023Updated 2 years ago
- Structured Binary Neural Networks for Image Recognition☆18Nov 18, 2021Updated 4 years ago
- [ECCV 2024] SparseRefine: Sparse Refinement for Efficient High-Resolution Semantic Segmentation☆16Jan 10, 2025Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆39Mar 11, 2024Updated 2 years ago
- [ASP-DAC 2025] "NeuronQuant: Accurate and Efficient Post-Training Quantization for Spiking Neural Networks" Official Implementation☆19Mar 6, 2025Updated last year
- nn2FPGA converts ONNX models into FPGA dataflow accelerators with seamless ONNX Runtime integration.☆21Jun 25, 2026Updated last week
- [ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers☆144Jan 10, 2024Updated 2 years ago
- Open-source of MSD framework☆16Sep 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆106Jun 2, 2024Updated 2 years ago
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆103Jan 3, 2025Updated last year
- This repository contains low-bit quantization papers from 2020 to 2026 on top conference.☆174Jun 25, 2026Updated last week
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…☆56Mar 4, 2024Updated 2 years ago
- ☆82Jul 21, 2022Updated 3 years ago
- Official repository of SpikeZIP-TF in ICML2024☆51Dec 4, 2024Updated last year
- Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)☆92Jul 28, 2025Updated 11 months ago
- Implementation of Input Stationary, Weight Stationary and Output Stationary dataflow for given neural network on a tiled architecture☆10Apr 19, 2020Updated 6 years ago
- Official repo of LookWhere (NeurIPS 2025) for efficient high-res visual recognition☆16Oct 23, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- c++ version of ViT☆12Nov 13, 2022Updated 3 years ago
- ☆12Jun 4, 2024Updated 2 years ago
- Code for "End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks", ECCV 2024☆39Oct 25, 2024Updated last year
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Dec 3, 2021Updated 4 years ago
- ☆14Jun 22, 2022Updated 4 years ago
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆86Jun 26, 2024Updated 2 years ago
- Python implementation of "MAPS: Multiresolution Adaptive Parameterization of Surfaces"☆12Oct 24, 2021Updated 4 years ago
- A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…☆206Feb 10, 2025Updated last year
- ☆12May 5, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models☆39Feb 4, 2024Updated 2 years ago
- ☆12Aug 5, 2025Updated 10 months ago
- The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"☆93Mar 17, 2025Updated last year
- Official implementation of the ICLR'25 paper "QERA: an Analytical Framework for Quantization Error Reconstruction".☆14Feb 4, 2025Updated last year
- Revisit Kernel Pruning with Lottery Regulated Grouped Convolutions. ICLR 2022☆11Nov 24, 2022Updated 3 years ago
- PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005☆49Nov 8, 2024Updated last year
- Implement spike-drive using OR residual connection and propose SynA attention for natural pruning.(Under Review)☆13Mar 31, 2024Updated 2 years ago