[PR 2024] HTQ: Exploring the High-Dimensional Trade-Off of Mixed-Precision Quantization
☆12Jul 16, 2024Updated last year
Alternatives and similar repositories for HTQ
Users that are interested in HTQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2022] Patch Similarity Aware Data-Free Quantization for Vision Transformers☆124Dec 22, 2022Updated 3 years ago
- [ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers☆144Jan 10, 2024Updated 2 years ago
- [ICCV 2023] I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference☆207Sep 2, 2024Updated last year
- ☆18Jan 17, 2024Updated 2 years ago
- [TIP 2026] The official implementation of "EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models"☆21Jul 8, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆15Apr 7, 2026Updated 2 months ago
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆86Jun 26, 2024Updated 2 years ago
- List of papers related to neural network quantization in recent AI conferences and journals.☆831Mar 27, 2025Updated last year
- ECCV 2026 paper template☆42Jan 23, 2026Updated 5 months ago
- ☆14Jun 21, 2026Updated last week
- Efficient GPU kernels for mixed-precision Vision Transformers in Triton☆17Sep 18, 2025Updated 9 months ago
- image demoireing, moire synthesis☆17Apr 25, 2024Updated 2 years ago
- Implementation of the paper 'Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance' (EMNLP 2025)☆31Dec 16, 2025Updated 6 months ago
- The code for Joint Neural Architecture Search and Quantization☆14Apr 10, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆21Aug 6, 2025Updated 10 months ago
- A tool for model sparse based on torch.fx☆13Jun 3, 2024Updated 2 years ago
- ☆17Jun 13, 2022Updated 4 years ago
- super-resolution; post-training quantization; model compression☆14Nov 10, 2023Updated 2 years ago
- This Repository allows to convert *.weights file of darknet format to *.pt (pytorch format) and *.onnx (ONNX format).☆25Jan 28, 2021Updated 5 years ago
- ☆20Jun 6, 2026Updated 3 weeks ago
- ☆15Mar 21, 2025Updated last year
- [CVPR'20] ZeroQ Mixed-Precision implementation (unofficial): A Novel Zero Shot Quantization Framework☆14Dec 16, 2020Updated 5 years ago
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆25Mar 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆36Mar 29, 2023Updated 3 years ago
- JavaScript firmware for ESP32☆18Aug 6, 2023Updated 2 years ago
- My academic homepage☆15Jan 15, 2022Updated 4 years ago
- ☆22Nov 26, 2025Updated 7 months ago
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆131Sep 23, 2025Updated 9 months ago
- Pytorch implementation of our paper accepted by ECCV 2022-- Fine-grained Data Distribution Alignment for Post-Training Quantization☆16Sep 13, 2022Updated 3 years ago
- ☆16Oct 29, 2021Updated 4 years ago
- [ICCV 2025] QuantCache:Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation☆18Sep 26, 2025Updated 9 months ago
- This repository includes the official implementation of our paper "Grouping First, Attending Smartly: Training-Free Acceleration for Diff…☆55May 21, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Artifact repository for paper Automatic Generation of High-Performance Quantized Machine Learning Kernels☆17Oct 13, 2020Updated 5 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆24Aug 17, 2021Updated 4 years ago
- Official implementation, datasets and trained models of "SegNeuron: 3D Neuron Instance Segmentation in Any EM Volume with a Generalist Mo…☆23Jun 1, 2026Updated last month
- tinybig for deep function learning☆58Jun 6, 2025Updated last year
- [ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restorati…☆59Apr 13, 2026Updated 2 months ago
- ☆28Nov 5, 2021Updated 4 years ago
- [ICLR2026] The first W4A4KV4 quantized + 50% sparse LLMs!☆33Jan 26, 2026Updated 5 months ago