quic / aimet-model-zoo
☆302Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for aimet-model-zoo
- A parser, editor and profiler tool for ONNX models.☆400Updated this week
- ☆195Updated 3 years ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆258Updated last year
- Model Quantization Benchmark☆765Updated 5 months ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆268Updated 2 years ago
- A simple network quantization demo using pytorch from scratch.☆511Updated last year
- Pytorch implementation of BRECQ, ICLR 2021☆253Updated 3 years ago
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆328Updated this week
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆413Updated last year
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆336Updated 3 months ago
- Inference of quantization aware trained networks using TensorRT☆79Updated last year
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,148Updated this week
- ☆214Updated 2 years ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆293Updated 2 months ago
- ☆122Updated last year
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆274Updated 11 months ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆95Updated 2 years ago
- PyTorch Quantization Aware Training Example☆122Updated 6 months ago
- Offline Quantization Tools for Deploy.☆116Updated 10 months ago
- Quantization of Convolutional Neural networks.☆238Updated 3 months ago
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆754Updated 3 weeks ago
- A code generator from ONNX to PyTorch code☆133Updated 2 years ago
- Unofficial implementation of LSQ-Net, a neural network quantization framework☆277Updated 6 months ago
- Actively maintained ONNX Optimizer☆647Updated 8 months ago
- A library for researching neural networks compression and acceleration methods.☆136Updated 2 months ago
- Post-Training Quantization for Vision transformers.☆190Updated 2 years ago
- ☆104Updated last month
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆308Updated last year
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆95Updated 3 years ago
- ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting (ICCV 2021)☆288Updated last year