NVIDIA / tao_deploy
Package for deploying deep learning models from TAO Toolkit
☆19Updated 6 months ago
Alternatives and similar repositories for tao_deploy:
Users that are interested in tao_deploy are comparing it to the libraries listed below
- TAO Toolkit deep learning networks with PyTorch backend☆91Updated 4 months ago
- Quick start scripts and tutorial notebooks to get started with TAO Toolkit☆70Updated 6 months ago
- TAO Toolkit deep learning networks with TensorFlow 1.x backend☆13Updated last year
- ☆93Updated 5 months ago
- A collection of reference AI microservices and workflows for Jetson Platform Services☆35Updated last month
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆48Updated 9 months ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆189Updated 9 months ago
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆48Updated this week
- Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function ind…☆89Updated 11 months ago
- ☆32Updated last year
- ☆66Updated 2 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆38Updated 2 years ago
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆60Updated 2 years ago
- This repository describes how to add a custom TensorRT plugin in c++ and python☆28Updated 3 years ago
- Deploy RT-EDTR with onnx from paddlepaddle framwork and graph cut☆29Updated last year
- ☆31Updated 8 months ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆101Updated 2 weeks ago
- A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson☆179Updated last year
- CLIP and SigLIP models optimized with TensorRT with a Transformers-like API☆22Updated 5 months ago
- ☆61Updated 4 months ago
- Sample app code for deploying TAO Toolkit trained models to Triton☆86Updated 6 months ago
- ☆171Updated last year
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆321Updated 2 years ago
- CUda Matrix Multiply library.☆74Updated last week
- Standalone Flash Attention v2 kernel without libtorch dependency☆105Updated 6 months ago
- Edge AI Model Development Tools☆44Updated this week
- ☆158Updated last year
- TAO best practices. How to adapt for a new domain, new classes, and generalize the model with a small dataset using Nvidia's TAO toolkit☆24Updated 2 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year