quic / aimet-model-zoo
☆313Updated last year
Alternatives and similar repositories for aimet-model-zoo:
Users that are interested in aimet-model-zoo are comparing it to the libraries listed below
- A parser, editor and profiler tool for ONNX models.☆418Updated last month
- Model Quantization Benchmark☆788Updated last month
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,232Updated this week
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆260Updated last year
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆366Updated this week
- ☆202Updated 3 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆344Updated 7 months ago
- Pytorch implementation of BRECQ, ICLR 2021☆266Updated 3 years ago
- Actively maintained ONNX Optimizer☆673Updated last month
- PyTorch implementation for the APoT quantization (ICLR 2020)☆272Updated 2 months ago
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆286Updated 10 months ago
- A simple network quantization demo using pytorch from scratch.☆521Updated last year
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆427Updated last year
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆300Updated 5 months ago
- PyTorch Quantization Aware Training Example☆130Updated 9 months ago
- A code generator from ONNX to PyTorch code☆135Updated 2 years ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆203Updated 4 years ago
- ☆139Updated last year
- ☆222Updated 2 years ago
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆978Updated this week
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆394Updated 2 years ago
- Offline Quantization Tools for Deploy.☆123Updated last year
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆276Updated last year
- Quantization of Convolutional Neural networks.☆243Updated 6 months ago
- ☆224Updated 3 years ago
- ☆126Updated 3 months ago
- On-the-fly Structured Pruning for PyTorch models. This library implements several attributions metrics and structured pruning utils for n…☆164Updated 4 years ago
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆379Updated 4 years ago
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆325Updated last year