This is the official PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".
☆44Nov 25, 2021Updated 4 years ago
Alternatives and similar repositories for SAQ
Users that are interested in SAQ are comparing it to the libraries listed below
Sorting:
- [TPAMI 2024] This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.☆115Dec 30, 2023Updated 2 years ago
- [ICCV 2021] Official implementation of "Scalable Vision Transformers with Hierarchical Pooling"☆33Dec 30, 2021Updated 4 years ago
- PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)☆18Jun 22, 2022Updated 3 years ago
- ☆18Oct 4, 2022Updated 3 years ago
- Training Quantized Neural Networks with a Full-precision Auxiliary Module☆13Jun 19, 2020Updated 5 years ago
- Post-training sparsity-aware quantization☆34Feb 26, 2023Updated 3 years ago
- Quantization in the Jagged Loss Landscape of Vision Transformers☆13Oct 22, 2023Updated 2 years ago
- This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".☆120Dec 12, 2021Updated 4 years ago
- Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples [NeurIPS 2021]☆34Dec 12, 2021Updated 4 years ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆39Mar 11, 2024Updated last year
- ☆22Oct 27, 2024Updated last year
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆11Sep 28, 2023Updated 2 years ago
- ☆21Feb 11, 2022Updated 4 years ago
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆23Mar 29, 2024Updated last year
- An official PyTorch implementation of the paper "Distance-aware Quantization", ICCV 2021.☆48Nov 1, 2024Updated last year
- ☆28Oct 21, 2020Updated 5 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Dec 3, 2021Updated 4 years ago
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆128Sep 23, 2025Updated 5 months ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Jun 19, 2022Updated 3 years ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆31Mar 12, 2024Updated last year
- ☆13Aug 9, 2022Updated 3 years ago
- A simple program scheduler for your code on different devices.☆12Aug 15, 2024Updated last year
- The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…☆49Oct 5, 2022Updated 3 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Feb 26, 2025Updated last year
- Unofficial Scalable-Softmax Is Superior for Attention☆20May 30, 2025Updated 9 months ago
- The official implementation of "NAS-BNN: Neural Architecture Search for Binary Neural Networks"☆13Aug 30, 2024Updated last year
- ☆14Oct 6, 2023Updated 2 years ago
- Inference-based time-resolved whole-brain imaging☆14Apr 16, 2024Updated last year
- An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.☆14Feb 3, 2025Updated last year
- BitSplit Post-trining Quantization☆50Dec 20, 2021Updated 4 years ago
- An official implementation of "Network Quantization with Element-wise Gradient Scaling" (CVPR 2021) in PyTorch.☆94Jul 14, 2023Updated 2 years ago
- official implementation of Generative Low-bitwidth Data Free Quantization(GDFQ)☆55Jul 23, 2023Updated 2 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆93May 5, 2022Updated 3 years ago
- A collection of papers and resources about Data Centric Graph Machine Learning (DC-GML)☆39Sep 23, 2023Updated 2 years ago
- Structured Binary Neural Networks for Image Recognition☆18Nov 18, 2021Updated 4 years ago
- The code for Joint Neural Architecture Search and Quantization☆14Apr 10, 2019Updated 6 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆35Jul 12, 2022Updated 3 years ago
- Code for ICML 2021 submission☆35Mar 24, 2021Updated 4 years ago
- PyTorch re-implementation of Hierarchical Normalization for Robust Monocular Depth Estimation☆21Dec 8, 2022Updated 3 years ago