qbxlvnf11 / convert-pytorch-onnx-tensorrt
Converting weights of Pytorch models to ONNX & TensorRT engines
☆48Updated 2 years ago
Alternatives and similar repositories for convert-pytorch-onnx-tensorrt:
Users that are interested in convert-pytorch-onnx-tensorrt are comparing it to the libraries listed below
- Script to typecast ONNX model parameters from INT64 to INT32.☆106Updated last year
- Simple example of FastAPI + Celery + Triton for benchmarking☆64Updated 2 years ago
- ONNX Runtime Inference C++ Example☆235Updated last month
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆107Updated 2 weeks ago
- NVIDIA DeepStream SDK 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 application for YOLO-Face models☆66Updated last year
- TensortRT installation and Conversion from PyTorch Models☆32Updated 4 years ago
- This repository provides YOLOV5 GPU optimization sample☆102Updated 2 years ago
- ☆34Updated last year
- NVIDIA DeepStream SDK 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 implementation for YOLO-Segmentation models☆62Updated last year
- ☆94Updated 7 months ago
- Examples for inference models with ONNXRuntime and CUDA☆21Updated last year
- This repository serves as an example of deploying the YOLO models on Triton Server for performance and testing purposes☆61Updated 11 months ago
- Magface Triton Inferece Server Using Tensorrt☆16Updated 3 years ago
- ByteTrack implementation for person tracking using PyTorch☆32Updated 2 years ago
- TensorRT Examples (TensorRT, Jetson Nano, Python, C++)☆94Updated last year
- Zero-label image classification via OpenCLIP knowledge distillation☆125Updated last year
- ☆53Updated 3 years ago
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆68Updated last week
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆74Updated 3 weeks ago
- This repo provides the C++ implementation of YOLO-NAS based on ONNXRuntime for performing object detection in real-time.Support float32/f…☆43Updated last year
- ☆118Updated last year
- implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks (https://arxiv.org/abs/2105.04206)☆17Updated 3 years ago
- ☆78Updated last year
- Sample app code for deploying TAO Toolkit trained models to Triton☆87Updated 8 months ago
- The Triton backend for TensorRT.☆74Updated this week
- Simple example of FastAPI + gRPC AsyncIO + Triton☆64Updated 2 years ago
- Simple console app that implements ONNX Runtime and ResNet in C++☆49Updated 2 years ago
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆33Updated 3 years ago
- Some improvements on ArcFace model☆14Updated 2 years ago
- DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom framewor…☆51Updated 6 months ago