ModelTC / MQBenchLinks
Model Quantization Benchmark
☆857Updated 9 months ago
Alternatives and similar repositories for MQBench
Users that are interested in MQBench are comparing it to the libraries listed below
Sorting:
- A simple network quantization demo using pytorch from scratch.☆543Updated 2 years ago
- Pytorch implementation of BRECQ, ICLR 2021☆289Updated 4 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆453Updated 2 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆408Updated 3 years ago
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,780Updated last year
- A parser, editor and profiler tool for ONNX models.☆478Updated 3 months ago
- Offline Quantization Tools for Deploy.☆142Updated 2 years ago
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆358Updated 2 years ago
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆862Updated last month
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆263Updated 2 years ago
- Everything in Torch Fx☆345Updated last year
- Unofficial implementation of LSQ-Net, a neural network quantization framework☆310Updated last year
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆400Updated 4 years ago
- Neural Network Quantization & Low-Bit Fixed Point Training For Hardware-Friendly Algorithm Design☆161Updated 5 years ago
- 针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库☆255Updated 2 years ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆283Updated last year
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆449Updated 2 years ago
- Summary, Code for Deep Neural Network Quantization☆559Updated 7 months ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆280Updated 2 years ago
- ☆244Updated 3 years ago
- ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting (ICCV 2021)☆300Updated 3 years ago
- Post-Training Quantization for Vision transformers.☆236Updated 3 years ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,272Updated 8 months ago
- Quantization of Convolutional Neural networks.☆250Updated last year
- base quantization methods including: QAT, PTQ, per_channel, per_tensor, dorefa, lsq, adaround, omse, Histogram, bias_correction.etc☆51Updated 3 years ago
- ☆1,047Updated last year
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆364Updated last year
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆310Updated last year
- A nnie quantization aware training tool on pytorch.☆238Updated 5 years ago
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆127Updated 4 months ago