ModelTC / MQBench
Model Quantization Benchmark
☆788Updated last month
Alternatives and similar repositories for MQBench:
Users that are interested in MQBench are comparing it to the libraries listed below
- A simple network quantization demo using pytorch from scratch.☆521Updated last year
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆427Updated last year
- Pytorch implementation of BRECQ, ICLR 2021☆266Updated 3 years ago
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,638Updated 11 months ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆394Updated 2 years ago
- A parser, editor and profiler tool for ONNX models.☆418Updated last month
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆325Updated last year
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆379Updated 4 years ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆260Updated last year
- PyTorch implementation for the APoT quantization (ICLR 2020)☆272Updated 2 months ago
- Unofficial implementation of LSQ-Net, a neural network quantization framework☆287Updated 9 months ago
- 针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库☆246Updated last year
- Everything in Torch Fx☆341Updated 8 months ago
- Offline Quantization Tools for Deploy.☆123Updated last year
- ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting (ICCV 2021)☆293Updated 2 years ago
- Post-Training Quantization for Vision transformers.☆206Updated 2 years ago
- ☆222Updated 2 years ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆276Updated last year
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆787Updated 2 months ago
- Quantization of Convolutional Neural networks.☆243Updated 6 months ago
- Neural Network Quantization & Low-Bit Fixed Point Training For Hardware-Friendly Algorithm Design☆160Updated 4 years ago
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆439Updated last year
- Summary, Code for Deep Neural Network Quantization☆544Updated 4 months ago
- A nnie quantization aware training tool on pytorch.☆239Updated 4 years ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,229Updated 3 years ago
- ONNX2Pytorch☆160Updated 3 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆344Updated 7 months ago
- Collection of recent methods on (deep) neural network compression and acceleration.☆936Updated 3 months ago
- [CVPR 2020] This project is the PyTorch implementation of our accepted CVPR 2020 paper : forward and backward information retention for a…☆179Updated 4 years ago
- ☆224Updated 3 years ago