NVIDIA / tao_deployLinks
Package for deploying deep learning models from TAO Toolkit
☆20Updated 10 months ago
Alternatives and similar repositories for tao_deploy
Users that are interested in tao_deploy are comparing it to the libraries listed below
Sorting:
- Quick start scripts and tutorial notebooks to get started with TAO Toolkit☆94Updated 10 months ago
- TAO Toolkit deep learning networks with PyTorch backend☆95Updated 8 months ago
- ☆99Updated 10 months ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆204Updated last year
- TAO Toolkit deep learning networks with TensorFlow 1.x backend☆13Updated last year
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆334Updated 3 years ago
- A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson☆201Updated last year
- YOLOv5 on Orin DLA☆205Updated last year
- A collection of reference AI microservices and workflows for Jetson Platform Services☆42Updated 5 months ago
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆87Updated 6 months ago
- ☆32Updated last year
- Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function ind…☆98Updated last year
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆115Updated 2 months ago
- A reference application for a local AI assistant with LLM and RAG☆112Updated 7 months ago
- Using Unified Memory on Jetson☆27Updated 3 years ago
- This repository provides YOLOV5 GPU optimization sample☆106Updated 2 years ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆292Updated 8 months ago
- A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.☆345Updated 5 months ago
- Awesome code, projects, books, etc. related to CUDA☆19Updated this week
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆76Updated 2 months ago
- Deploy RT-EDTR with onnx from paddlepaddle framwork and graph cut☆29Updated 2 years ago
- Edge AI Model Development Tools☆66Updated last week
- ☆61Updated last year
- Profile PyTorch models for FLOPs and parameters, helping to evaluate computational efficiency and memory usage.☆47Updated 2 months ago
- ☆66Updated 2 years ago
- Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption☆103Updated last year
- ☆71Updated 8 months ago
- This repository describes how to add a custom TensorRT plugin in c++ and python☆28Updated 4 years ago
- Tensorrt codebase to inference in c++ for all major neural arch using onnx☆35Updated 5 months ago
- DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom framewor…☆59Updated 8 months ago