yanghr / BSQView external linksLinks
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)
☆42Jan 12, 2021Updated 5 years ago
Alternatives and similar repositories for BSQ
Users that are interested in BSQ are comparing it to the libraries listed below
Sorting:
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Jan 3, 2022Updated 4 years ago
- Neural Network Quantization With Fractional Bit-widths☆11Feb 19, 2021Updated 4 years ago
- ☆42Dec 15, 2022Updated 3 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆453May 15, 2023Updated 2 years ago
- ☆25Dec 11, 2021Updated 4 years ago
- Codes for our paper "Exploring Bit-Slice Sparsity in Deep Neural Networks for Efficient ReRAM-Based Deployment" [NeurIPS'19 EMC2 workshop]…☆10Oct 12, 2020Updated 5 years ago
- PyTorch implementation of EdMIPS: https://arxiv.org/pdf/2004.05795.pdf☆60Jul 27, 2020Updated 5 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Apr 6, 2021Updated 4 years ago
- The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)☆139Nov 19, 2020Updated 5 years ago
- pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"☆129Jan 2, 2020Updated 6 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆98Jun 10, 2021Updated 4 years ago
- AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.☆13Nov 8, 2021Updated 4 years ago
- PyTorch implementation of Towards Efficient Training for Neural Network Quantization☆16Jan 16, 2020Updated 6 years ago
- Neural Network Evaluation Tool on Crossbar-based Accelerator with Resistive Memory☆42Oct 30, 2019Updated 6 years ago
- EQ-Net [ICCV 2023]☆30Aug 15, 2023Updated 2 years ago
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆402Feb 26, 2021Updated 4 years ago
- ☆24Apr 20, 2024Updated last year
- Adaptive floating-point based numerical format for resilient deep learning☆14Apr 11, 2022Updated 3 years ago
- Pytorch implementation of BRECQ, ICLR 2021☆289Aug 1, 2021Updated 4 years ago
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆58Feb 7, 2023Updated 3 years ago
- MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)☆21Apr 17, 2024Updated last year
- Implementation of a Quantized Transformer Model☆19Mar 20, 2019Updated 6 years ago
- Provides the code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators" by Luk…☆19Oct 6, 2019Updated 6 years ago
- A bit-level sparsity-awared multiply-accumulate process element.☆18Jul 9, 2024Updated last year
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆35Jun 29, 2023Updated 2 years ago
- ☆26Dec 12, 2022Updated 3 years ago
- Unofficial implementation of LSQ-Net, a neural network quantization framework☆310May 8, 2024Updated last year
- MNSIM version 1.1. We have uploaded a high-level modeling tool and please use this version: https://github.com/Zhu-Zhenhua/MNSIM_Python☆12Dec 12, 2019Updated 6 years ago
- source code of the paper: Robust Quantization: One Model to Rule Them All☆41Mar 24, 2023Updated 2 years ago
- An official implementation of "Network Quantization with Element-wise Gradient Scaling" (CVPR 2021) in PyTorch.☆94Jul 14, 2023Updated 2 years ago
- ☆28Oct 21, 2020Updated 5 years ago
- ReActNet: Towards Precise Binary NeuralNetwork with Generalized Activation Functions. In ECCV 2020.☆263Nov 11, 2021Updated 4 years ago
- Training Quantized Neural Networks with a Full-precision Auxiliary Module☆13Jun 19, 2020Updated 5 years ago
- Benchmark framework of compute-in-memory based accelerators for deep neural network (inference engine focused)☆10Jun 1, 2021Updated 4 years ago
- PyTorch code for full quantization of DNN using BCGD☆14Jul 24, 2019Updated 6 years ago
- Self-Distribution BNN☆10Mar 8, 2022Updated 3 years ago
- ☆13Oct 26, 2023Updated 2 years ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆280Dec 8, 2023Updated 2 years ago
- Fast Emulation of Approximate DNN Accelerators in PyTorch☆29Feb 23, 2024Updated last year