qbxlvnf11 / convert-pytorch-onnx-tensorrtLinks
Converting weights of Pytorch models to ONNX & TensorRT engines
☆50Updated 2 years ago
Alternatives and similar repositories for convert-pytorch-onnx-tensorrt
Users that are interested in convert-pytorch-onnx-tensorrt are comparing it to the libraries listed below
Sorting:
- Script to typecast ONNX model parameters from INT64 to INT32.☆107Updated last year
- ONNX Runtime Inference C++ Example☆241Updated 4 months ago
- This repository provides YOLOV5 GPU optimization sample☆106Updated 2 years ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆118Updated 3 months ago
- ☆193Updated 2 months ago
- ☆99Updated 10 months ago
- A Toolkit to Help Optimize Onnx Model☆188Updated last week
- TensortRT installation and Conversion from PyTorch Models☆33Updated 4 years ago
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆77Updated 3 months ago
- Magface Triton Inferece Server Using Tensorrt☆17Updated 3 years ago
- ☆122Updated 2 years ago
- Simple example of FastAPI + Celery + Triton for benchmarking☆64Updated 2 years ago
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆136Updated last week
- This repository serves as an example of deploying the YOLO models on Triton Server for performance and testing purposes☆65Updated last year
- DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom framewor…☆62Updated 9 months ago
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆76Updated 2 months ago
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆296Updated last year
- TensorRT Examples (TensorRT, Jetson Nano, Python, C++)☆96Updated last year
- ☆36Updated 2 years ago
- ☆31Updated 3 years ago
- nvjpeg for python☆103Updated 2 years ago
- The Triton backend for TensorRT.☆77Updated last week
- NVIDIA DeepStream SDK 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 application for YOLO-Face models☆71Updated last year
- Simple example of FastAPI + gRPC AsyncIO + Triton☆68Updated 2 years ago
- ☆79Updated last year
- NVIDIA DeepStream SDK 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 implementation for YOLO-Segmentation models☆65Updated last year
- Conversion of PyTorch Models into TFLite☆389Updated 2 years ago
- Implementation of End-to-End YOLO Models for DeepStream☆58Updated 9 months ago
- Examples for inference models with ONNXRuntime and CUDA☆23Updated 2 years ago
- implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks (https://arxiv.org/abs/2105.04206)☆17Updated 3 years ago