qbxlvnf11 / convert-pytorch-onnx-tensorrtLinks

Converting weights of Pytorch models to ONNX & TensorRT engines

☆50

Alternatives and similar repositories for convert-pytorch-onnx-tensorrt

Users that are interested in convert-pytorch-onnx-tensorrt are comparing it to the libraries listed below

Sorting:

aadhithya / onnx-typecast
Script to typecast ONNX model parameters from INT64 to INT32.
☆107Updated last year
leimao / ONNX-Runtime-Inference
ONNX Runtime Inference C++ Example
☆241Updated 4 months ago
NVIDIA-AI-IOT / yolov5_gpu_optimization
This repository provides YOLOV5 GPU optimization sample
☆106Updated 2 years ago
levipereira / yolov9-qat
Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.
☆118Updated 3 months ago
LilianHollard / LeYOLO
☆193Updated 2 months ago
NVIDIA-AI-IOT / nvidia-tao
☆99Updated 10 months ago
inisis / OnnxSlim
A Toolkit to Help Optimize Onnx Model
☆188Updated last week
sithu31296 / PyTorch-ONNX-TRT
TensortRT installation and Conversion from PyTorch Models
☆33Updated 4 years ago
WongKinYiu / GeneralistYOLO
Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models
☆77Updated 3 months ago
tonhathuy / tensorrt-triton-magface
Magface Triton Inferece Server Using Tensorrt
☆17Updated 3 years ago
ChuRuaNh0 / FastSam_Awsome_TensorRT
☆122Updated 2 years ago
Curt-Park / mnist-fastapi-celery-triton
Simple example of FastAPI + Celery + Triton for benchmarking
☆64Updated 2 years ago
triton-inference-server / dali_backend
The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
☆136Updated last week
levipereira / triton-server-yolo
This repository serves as an example of deploying the YOLO models on Triton Server for performance and testing purposes
☆65Updated last year
NVIDIA-AI-IOT / deepstream_libraries
DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom framewor…
☆62Updated 9 months ago
NVIDIA-AI-IOT / NVIDIA-Optical-Character-Detection-and-Recognition-Solution
This repository provides optical character detection and recognition solution optimized on Nvidia devices.
☆76Updated 2 months ago
PINTO0309 / simple-onnx-processing-tools
A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…
☆296Updated last year
NobuoTsukamoto / tensorrt-examples
TensorRT Examples (TensorRT, Jetson Nano, Python, C++)
☆96Updated last year
masamitsu-murase / deform_conv2d_onnx_exporter
☆36Updated 2 years ago
YH-Wu / Triton-Inference-Server-on-Kubernetes
☆31Updated 3 years ago
UsingNet / nvjpeg-python
nvjpeg for python
☆103Updated 2 years ago
triton-inference-server / tensorrt_backend
The Triton backend for TensorRT.
☆77Updated last week
marcoslucianops / DeepStream-Yolo-Face
NVIDIA DeepStream SDK 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 application for YOLO-Face models
☆71Updated last year
Curt-Park / mnist-fastapi-aio-triton
Simple example of FastAPI + gRPC AsyncIO + Triton
☆68Updated 2 years ago
mingj2021 / segment-anything-tensorrt
☆79Updated last year
marcoslucianops / DeepStream-Yolo-Seg
NVIDIA DeepStream SDK 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 implementation for YOLO-Segmentation models
☆65Updated last year
sithu31296 / PyTorch-ONNX-TFLite
Conversion of PyTorch Models into TFLite
☆389Updated 2 years ago
levipereira / deepstream-yolo-e2e
Implementation of End-to-End YOLO Models for DeepStream
☆58Updated 9 months ago
developer0hye / onnxruntime-cuda-cpp-example
Examples for inference models with ONNXRuntime and CUDA
☆23Updated 2 years ago
NNDam / yolor
implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks (https://arxiv.org/abs/2105.04206)
☆17Updated 3 years ago