NVIDIA / tao_deployLinks
Package for deploying deep learning models from TAO Toolkit
☆24Updated 2 months ago
Alternatives and similar repositories for tao_deploy
Users that are interested in tao_deploy are comparing it to the libraries listed below
Sorting:
- TAO Toolkit deep learning networks with PyTorch backend☆107Updated 2 months ago
- Quick start scripts and tutorial notebooks to get started with TAO Toolkit☆132Updated last month
- High-performance, light-weight C++ LLM and VLM Inference Software for Physical AI☆227Updated last month
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆364Updated 3 years ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆225Updated last year
- ☆107Updated 3 months ago
- YOLOv5 on Orin DLA☆221Updated last year
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆129Updated 9 months ago
- This repository provides YOLOV5 GPU optimization sample☆106Updated 3 years ago
- ☆70Updated 3 years ago
- A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson☆231Updated 2 years ago
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆87Updated 9 months ago
- DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom framewor…☆77Updated 4 months ago
- A simple tool that can generate TensorRT plugin code quickly.☆239Updated 2 years ago
- Deep Learning tools and applications for NVIDIA AGX platforms.☆266Updated last week
- A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.☆401Updated last year
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆79Updated 8 months ago
- Sample app code for deploying TAO Toolkit trained models to Triton☆89Updated last year
- Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function ind…☆108Updated last year
- ☆33Updated 2 months ago
- NVIDIA DeepStream SDK 8.0 / 7.1 / 7.0 / 6.4 / 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 application for YOLO-Segmentation models☆95Updated 3 months ago
- Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda☆23Updated 3 weeks ago
- This project aims to explore the deployment of Swin-Transformer based on TensorRT, including the test results of FP16 and INT8.☆174Updated 3 years ago
- Deploy RT-EDTR with onnx from paddlepaddle framwork and graph cut☆31Updated 2 years ago
- A project demonstrating how to use nvmetamux to run multiple models in parallel.☆112Updated last year
- This repository describes how to add a custom TensorRT plugin in c++ and python☆29Updated 4 years ago
- Awesome code, projects, books, etc. related to CUDA☆30Updated this week
- A tool convert TensorRT engine/plan to a fake onnx☆42Updated 3 years ago
- Tensorrt codebase to inference in c++ for all major neural arch using onnx☆39Updated last year
- An onnx-based quantitation tool.☆71Updated 2 years ago