avbelova / POT_tutorialLinks
OpenVINO Post-Training Optimization Toolkit Tutorial
☆16Updated 5 years ago
Alternatives and similar repositories for POT_tutorial
Users that are interested in POT_tutorial are comparing it to the libraries listed below
Sorting:
- Count number of parameters / MACs / FLOPS for ONNX models.☆95Updated last year
- Inference of quantization aware trained networks using TensorRT☆83Updated 3 years ago
- A code generator from ONNX to PyTorch code☆142Updated 3 years ago
- Parallel CUDA implementation of NON maximum Suppression☆81Updated 5 years ago
- ONNX converter and optimizer scirpts for Kneron hardware.☆40Updated 2 years ago
- ☆79Updated 4 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆37Updated 4 years ago
- quantize aware training package for NCNN on pytorch☆69Updated 4 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆10Updated 4 years ago
- Benchmark inference speed of CNNs with various quantization methods in Pytorch+TensorRT with Jetson Nano/Xavier☆56Updated 2 years ago
- MegEngine到其他框架的转换器☆70Updated 2 years ago
- ☆25Updated 3 years ago
- Scailable ONNX python tools☆98Updated last year
- Deep Learning Inference benchmark. Supports OpenVINO™ toolkit, TensorFlow, TensorFlow Lite, ONNX Runtime, OpenCV DNN, MXNet, PyTorch, Apa…☆35Updated this week
- Tencent NCNN with added CUDA support☆71Updated 5 years ago
- PyTorch Quantization Aware Training Example☆150Updated last year
- Android demo for dabnn☆20Updated 6 years ago
- ☆17Updated 5 years ago
- Model compression for ONNX☆98Updated last year
- ☆68Updated 2 years ago
- RidgeRun Inference Framework☆27Updated 3 years ago
- Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/☆104Updated last year
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆73Updated 3 years ago
- edge/mobile transformer based Vision DNN inference benchmark☆16Updated 5 months ago
- Using ideas from product quantization for state-of-the-art neural network compression.☆146Updated 4 years ago
- Additions and patches to Caffe framework for use with Synopsys DesignWare EV Family of Processors☆23Updated 3 months ago
- Jetson embedded platform-target deep learning inference acceleration framework with TensorRT☆30Updated 3 months ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Updated 2 years ago
- ☆52Updated 5 years ago