onnx / optimizer
Actively maintained ONNX Optimizer
☆647Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for optimizer
- A parser, editor and profiler tool for ONNX models.☆400Updated this week
- Common utilities for ONNX converters☆251Updated 5 months ago
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆338Updated this week
- TensorRT Plugin Autogen Tool☆367Updated last year
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,346Updated 2 weeks ago
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆286Updated this week
- Simplify your onnx model☆3,865Updated 2 months ago
- Deploy your model with TensorRT quickly.☆762Updated 11 months ago
- Transform ONNX model to PyTorch representation☆318Updated last week
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆433Updated last week
- Accelerate PyTorch models with ONNX Runtime☆356Updated 2 months ago
- PyTorch Neural Network eXchange☆526Updated this week
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆962Updated 2 months ago
- TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillati…☆567Updated this week
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,597Updated this week
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆493Updated 3 weeks ago
- Common source, scripts and utilities for creating Triton backends.☆295Updated this week
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆277Updated 6 months ago
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆570Updated this week
- TensorFlow/TensorRT integration☆736Updated 11 months ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆767Updated this week
- Model Quantization Benchmark☆765Updated 5 months ago
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆943Updated this week
- A code generator from ONNX to PyTorch code☆133Updated 2 years ago
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆125Updated 2 weeks ago
- A pytorch to tensorrt convert with dynamic shape support☆257Updated 9 months ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆692Updated last year
- Convert ONNX models to PyTorch.☆620Updated 3 months ago
- Inference of quantization aware trained networks using TensorRT☆79Updated last year
- ONNX-TensorRT: TensorRT backend for ONNX☆2,953Updated 2 weeks ago