[PR 2024] HTQ: Exploring the High-Dimensional Trade-Off of Mixed-Precision Quantization
☆12Jul 16, 2024Updated last year
Alternatives and similar repositories for HTQ
Users that are interested in HTQ are comparing it to the libraries listed below
Sorting:
- [ECCV 2022] Patch Similarity Aware Data-Free Quantization for Vision Transformers☆123Dec 22, 2022Updated 3 years ago
- [ICCV 2023] I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference☆200Sep 2, 2024Updated last year
- [ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers☆140Jan 10, 2024Updated 2 years ago
- The official implementation of "EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models"☆21Jul 8, 2025Updated 7 months ago
- ☆18Jan 17, 2024Updated 2 years ago
- ☆36Mar 29, 2023Updated 2 years ago
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆82Jun 26, 2024Updated last year
- ☆11Feb 7, 2025Updated last year
- LaTex template for ITMO style presentations☆10Jan 19, 2025Updated last year
- An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…☆10Dec 18, 2019Updated 6 years ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- ☆18Nov 26, 2025Updated 3 months ago
- Workshop materials for AI Engineer World's Fair☆14Jun 3, 2025Updated 8 months ago
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Apr 7, 2023Updated 2 years ago
- List of papers related to neural network quantization in recent AI conferences and journals.☆805Mar 27, 2025Updated 11 months ago
- ☆25Oct 11, 2025Updated 4 months ago
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 3 years ago
- ECCV 2026 paper template☆42Jan 23, 2026Updated last month
- face detection and recognization with sphereface and faceboxes☆12Dec 29, 2017Updated 8 years ago
- image demoireing, moire synthesis☆16Apr 25, 2024Updated last year
- Some LaTeX Tips for Writing Research Papers☆10May 30, 2016Updated 9 years ago
- A tool for model sparse based on torch.fx☆13Jun 3, 2024Updated last year
- ☆11Jan 10, 2025Updated last year
- https://github.com/ARM-software/ML-KWS-for-MCU☆14Jul 8, 2018Updated 7 years ago
- ☆17Jun 13, 2022Updated 3 years ago
- A tensorflow based implementation of DeepVoice3 https://arxiv.org/abs/1710.07654☆13Jun 5, 2018Updated 7 years ago
- Efficient GPU kernels for mixed-precision Vision Transformers in Triton☆18Sep 18, 2025Updated 5 months ago
- ☆12Jan 12, 2019Updated 7 years ago
- Code for reproducing the results in "Forecasting Human Dynamics from Static Images"☆13Jun 16, 2024Updated last year
- NeurIPS 2020 paper: UnModNet: Learning to Unwrap a Modulo Image for High Dynamic Range Imaging☆10Oct 24, 2021Updated 4 years ago
- BitSplit Post-trining Quantization☆50Dec 20, 2021Updated 4 years ago
- ☆16May 14, 2025Updated 9 months ago
- ☆13Oct 29, 2022Updated 3 years ago
- A pytorch reimplementation of liuwei16/CSP, their trained keras weights are loaded in pytorch.☆13Mar 27, 2020Updated 5 years ago
- AI方向资料汇总,涵盖机器学习,深度学习,计算机视觉等方向☆11Dec 17, 2022Updated 3 years ago
- super-resolution; post-training quantization; model compression☆14Nov 10, 2023Updated 2 years ago
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆128Sep 23, 2025Updated 5 months ago
- official implementation of Generative Low-bitwidth Data Free Quantization(GDFQ)☆55Jul 23, 2023Updated 2 years ago
- The code for Joint Neural Architecture Search and Quantization☆14Apr 10, 2019Updated 6 years ago