gmalivenko / onnx-opcounter
Count number of parameters / MACs / FLOPS for ONNX models.
☆84Updated 2 years ago
Related projects: ⓘ
- Inference of quantization aware trained networks using TensorRT☆77Updated last year
- PyTorch Quantization Aware Training Example☆119Updated 4 months ago
- Scailable ONNX python tools☆96Updated this week
- MegEngine到其他框架的转换器☆67Updated last year
- Offline Quantization Tools for Deploy.☆109Updated 8 months ago
- ☆52Updated 3 years ago
- A Toolkit to Help Optimize Large Onnx Model☆135Updated 4 months ago
- quantize aware training package for NCNN on pytorch☆68Updated 3 years ago
- A parser, editor and profiler tool for ONNX models.☆379Updated 3 weeks ago
- A code generator from ONNX to PyTorch code☆132Updated last year
- TensorRT plugin forDCNv2 layer in ONNX model☆57Updated 3 years ago
- A sample for onnxparser working with trt user defined plugins for TRT7.0☆166Updated 3 years ago
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆155Updated 7 months ago
- Parallel CUDA implementation of NON maximum Suppression☆77Updated 4 years ago
- Tencent NCNN with added CUDA support☆67Updated 3 years ago
- Utility scripts for editing or modifying onnx models. Utility scripts to summarize onnx model files along with visualization for loop ope…☆79Updated 3 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆32Updated 2 years ago
- Script to typecast ONNX model parameters from INT64 to INT32.☆92Updated 4 months ago
- A package to make do Network Slimming a little easier☆47Updated 2 years ago
- A pytorch to tensorrt convert with dynamic shape support☆255Updated 7 months ago
- ☆78Updated 3 years ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆202Updated 3 years ago
- resize image in (CUDA, python, cupy)☆36Updated last year
- symmetric int8 gemm☆66Updated 4 years ago
- Benchmark inference speed of CNNs with various quantization methods in Pytorch+TensorRT with Jetson Nano/Xavier☆54Updated last year
- PyTorch Static Quantization Example☆39Updated 3 years ago
- ☆95Updated 3 years ago
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆122Updated last week
- Serving Inside Pytorch☆141Updated last week
- Common utilities for ONNX converters☆245Updated 2 months ago