quic / aimet-model-zooLinks

☆335

Alternatives and similar repositories for aimet-model-zoo

Users that are interested in aimet-model-zoo are comparing it to the libraries listed below

Sorting:

ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆458Updated 2 months ago
alibaba / TinyNeuralNetwork
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
☆850Updated last month
NVIDIA / sampleQAT
Inference of quantization aware trained networks using TensorRT
☆83Updated 2 years ago
SonySemiconductorSolutions / mct-model-optimization
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…
☆418Updated last week
DeadAt0m / LSQFakeQuantize-PyTorch
FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch
☆36Updated 3 years ago
leimao / PyTorch-Quantization-Aware-Training
PyTorch Quantization Aware Training Example
☆141Updated last year
Talmaj / onnx2pytorch
Transform ONNX model to PyTorch representation
☆340Updated 11 months ago
Qualcomm-AI-research / transformer-quantization
☆205Updated 3 years ago
ModelTC / MQBench
Model Quantization Benchmark
☆842Updated 5 months ago
jakc4103 / DFQ
PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.
☆262Updated 2 years ago
quic / qidk
☆165Updated 3 months ago
Jermmy / pytorch-quantization-demo
A simple network quantization demo using pytorch from scratch.
☆538Updated 2 years ago
microsoft / nn-Meter
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
☆359Updated last year
Zhen-Dong / HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
☆450Updated 2 years ago
fumihwh / onnx-pytorch
A code generator from ONNX to PyTorch code
☆141Updated 2 years ago
gmalivenko / onnx-opcounter
Count number of parameters / MACs / FLOPS for ONNX models.
☆94Updated 11 months ago
yhhhli / BRECQ
Pytorch implementation of BRECQ, ICLR 2021
☆284Updated 4 years ago
quic / aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
☆2,471Updated this week
ModelTC / Dipoorlet
Offline Quantization Tools for Deploy.
☆139Updated last year
PINTO0309 / simple-onnx-processing-tools
A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…
☆297Updated last year
tianyic / only_train_once_personal_footprint
OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM
☆309Updated last year
yhhhli / APoT_Quantization
PyTorch implementation for the APoT quantization (ICLR 2020)
☆277Updated 10 months ago
megvii-research / FQ-ViT
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
☆350Updated 2 years ago
amirgholami / ZeroQ
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
☆280Updated last year
AI-performance / embedded-ai.bench
benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.
☆204Updated 4 years ago
lisosia / tflite-flops
Roughly calculate FLOPs of a tflite model
☆39Updated 4 years ago
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆282Updated last month
Qualcomm-AI-research / FP8-quantization
☆162Updated 2 years ago
ENOT-AutoDL / onnx2torch
Convert ONNX models to PyTorch.
☆704Updated this week
openvinotoolkit / nncf
Neural Network Compression Framework for enhanced OpenVINO™ inference
☆1,087Updated this week