microsoft / onnxconverter-commonLinks

Common utilities for ONNX converters

☆276

Alternatives and similar repositories for onnxconverter-common

Users that are interested in onnxconverter-common are comparing it to the libraries listed below

Sorting:

onnx / optimizer
ONNX Optimizer
☆737Updated this week
microsoft / onnxscript
ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.
☆369Updated this week
microsoft / onnxruntime-extensions
onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
☆404Updated last week
ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆446Updated last month
tsingmicro-toolchain / OnnxSlim
A Toolkit to Help Optimize Large Onnx Model
☆157Updated last year
microsoft / onnxruntime-training-examples
Examples for using ONNX Runtime for model training.
☆339Updated 9 months ago
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆156Updated 2 weeks ago
triton-inference-server / model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…
☆482Updated 2 weeks ago
inisis / OnnxSlim
A Toolkit to Help Optimize Onnx Model
☆188Updated this week
Tencent / TPAT
TensorRT Plugin Autogen Tool
☆369Updated 2 years ago
triton-inference-server / backend
Common source, scripts and utilities for creating Triton backends.
☆336Updated 2 weeks ago
scailable / sclblonnx
Scailable ONNX python tools
☆97Updated 9 months ago
pytorch / ort
Accelerate PyTorch models with ONNX Runtime
☆364Updated 5 months ago
triton-inference-server / dali_backend
The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
☆136Updated last week
NVIDIA / sampleQAT
Inference of quantization aware trained networks using TensorRT
☆83Updated 2 years ago
fumihwh / onnx-pytorch
A code generator from ONNX to PyTorch code
☆138Updated 2 years ago
PINTO0309 / simple-onnx-processing-tools
A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…
☆296Updated last year
Talmaj / onnx2pytorch
Transform ONNX model to PyTorch representation
☆338Updated 8 months ago
onnx / neural-compressor
Model compression for ONNX
☆97Updated 8 months ago
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆210Updated 3 months ago
triton-inference-server / tensorrt_backend
The Triton backend for TensorRT.
☆77Updated last week
onnx / onnx-mlir
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
☆887Updated last week
pytorch / multipy
torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…
☆180Updated 3 weeks ago
gmalivenko / onnx-opcounter
Count number of parameters / MACs / FLOPS for ONNX models.
☆93Updated 9 months ago
triton-inference-server / client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
☆637Updated 2 weeks ago
mlc-ai / tokenizers-cpp
Universal cross-platform tokenizers binding to HF and sentencepiece
☆367Updated last week
NVIDIA / TensorRT-Model-Optimizer
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. …
☆1,078Updated 3 weeks ago
microsoft / nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆991Updated 10 months ago
pytorch / kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
☆842Updated last week
quic / aimet-model-zoo
☆334Updated last year