ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆425Updated 3 months ago
Alternatives and similar repositories for onnx-tool:
Users that are interested in onnx-tool are comparing it to the libraries listed below
- ONNX Optimizer☆693Updated 3 weeks ago
- Model Quantization Benchmark☆799Updated this week
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,470Updated last month
- Offline Quantization Tools for Deploy.☆127Updated last year
- A Toolkit to Help Optimize Large Onnx Model☆153Updated 11 months ago
- Common utilities for ONNX converters☆266Updated 4 months ago
- TensorRT Plugin Autogen Tool☆369Updated 2 years ago
- A simple tool that can generate TensorRT plugin code quickly.☆230Updated last year
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- A simple network quantization demo using pytorch from scratch.☆526Updated last year
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆193Updated 10 months ago
- A sample for onnxparser working with trt user defined plugins for TRT7.0☆167Updated 4 years ago
- ONNX2Pytorch☆161Updated 4 years ago
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆501Updated 5 months ago
- PyTorch Neural Network eXchange☆574Updated last week
- row-major matmul optimization☆624Updated last year
- ☆319Updated last year
- A pytorch to tensorrt convert with dynamic shape support☆260Updated last year
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆290Updated last year
- VeriSilicon Tensor Interface Module☆234Updated 3 months ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆204Updated 4 years ago
- PyTorch Quantization Aware Training Example☆135Updated 11 months ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆431Updated last year
- nvidia-modelopt is a unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculat…☆870Updated this week
- Count number of parameters / MACs / FLOPS for ONNX models.☆91Updated 5 months ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆397Updated 2 years ago
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆823Updated last month
- YOLOv5 on Orin DLA☆197Updated last year
- ⚡ Useful scripts when using TensorRT☆242Updated 4 years ago
- ☆275Updated 2 years ago