ModelTC / DipoorletLinks
Offline Quantization Tools for Deploy.
☆140Updated last year
Alternatives and similar repositories for Dipoorlet
Users that are interested in Dipoorlet are comparing it to the libraries listed below
Sorting:
- ONNX2Pytorch☆164Updated 4 years ago
- A set of examples around MegEngine☆31Updated last year
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆143Updated 3 years ago
- A nnie quantization aware training tool on pytorch.☆238Updated 4 years ago
- Based of paper "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"☆64Updated 4 years ago
- ☆100Updated 4 years ago
- This project aims to explore the deployment of Swin-Transformer based on TensorRT, including the test results of FP16 and INT8.☆171Updated 2 years ago
- ☆81Updated 4 years ago
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆165Updated last year
- base quantization methods including: QAT, PTQ, per_channel, per_tensor, dorefa, lsq, adaround, omse, Histogram, bias_correction.etc☆50Updated 2 years ago
- A parser, editor and profiler tool for ONNX models.☆460Updated 2 months ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆405Updated 2 years ago
- A sample for onnxparser working with trt user defined plugins for TRT7.0☆169Updated 5 years ago
- A simple tool that can generate TensorRT plugin code quickly.☆236Updated 2 years ago
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆109Updated 3 years ago
- Model Quantization Benchmark☆843Updated 6 months ago
- Inference of quantization aware trained networks using TensorRT☆83Updated 2 years ago
- PyTorch Quantization Aware Training Example☆143Updated last year
- MegEngine到其他框架的转换器☆70Updated 2 years ago
- Everything in Torch Fx☆345Updated last year
- ☆44Updated 4 years ago
- 针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库☆254Updated 2 years ago
- A Toolkit to Help Optimize Large Onnx Model☆161Updated last year
- A simple network quantization demo using pytorch from scratch.☆538Updated 2 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆36Updated 3 years ago
- ☆43Updated 3 years ago
- arm-neon☆92Updated last year
- NART = NART is not A RunTime, a deep learning inference framework.☆37Updated 2 years ago
- An Improved One millisecond Mobile Backbone☆147Updated 3 years ago
- ☆36Updated 2 years ago