NVIDIA / tao_deployLinks
Package for deploying deep learning models from TAO Toolkit
☆20Updated 3 weeks ago
Alternatives and similar repositories for tao_deploy
Users that are interested in tao_deploy are comparing it to the libraries listed below
Sorting:
- TAO Toolkit deep learning networks with PyTorch backend☆98Updated 2 weeks ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆209Updated last year
- ☆68Updated 2 years ago
- Quick start scripts and tutorial notebooks to get started with TAO Toolkit☆100Updated 3 weeks ago
- ☆99Updated 10 months ago
- A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson☆206Updated last year
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆338Updated 3 years ago
- CLIP and SigLIP models optimized with TensorRT with a Transformers-like API☆30Updated 10 months ago
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆77Updated 3 months ago
- YOLOv5 on Orin DLA☆207Updated last year
- Zero-label image classification via OpenCLIP knowledge distillation☆134Updated last year
- This repository provides YOLOV5 GPU optimization sample☆106Updated 2 years ago
- Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function ind…☆98Updated last year
- Tensorrt codebase to inference in c++ for all major neural arch using onnx☆35Updated 5 months ago
- A reference application for a local AI assistant with LLM and RAG☆114Updated 8 months ago
- This repository describes how to add a custom TensorRT plugin in c++ and python☆28Updated 4 years ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆118Updated 3 months ago
- A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.☆354Updated 6 months ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Updated 2 years ago
- Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.☆81Updated this week
- A simple tool that can generate TensorRT plugin code quickly.☆231Updated 2 years ago
- Datasets, Transforms and Models specific to Computer Vision☆87Updated last year
- This project aims to explore the deployment of Swin-Transformer based on TensorRT, including the test results of FP16 and INT8.☆168Updated 2 years ago
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆87Updated 6 months ago
- ☆122Updated 2 years ago
- Awesome code, projects, books, etc. related to CUDA☆21Updated 3 weeks ago
- A collection of reference AI microservices and workflows for Jetson Platform Services☆44Updated 6 months ago
- ☆61Updated last year
- A unified evaluation library for multiple machine learning libraries☆266Updated last year
- Deep Learning tools and applications for NVIDIA AGX platforms.☆242Updated last month