A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson
☆242Nov 14, 2023Updated 2 years ago
Alternatives and similar repositories for jetson-intro-to-distillation
Users that are interested in jetson-intro-to-distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Zero-label image classification via OpenCLIP knowledge distillation☆144Sep 12, 2023Updated 2 years ago
- A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT☆883Nov 20, 2023Updated 2 years ago
- A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.☆433Feb 6, 2025Updated last year
- A unified and extensible pipeline for deep learning model inference with C++. Now support yolov8, yolov9, clip, and nanosam. More models …☆12Aug 3, 2025Updated 9 months ago
- A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson☆371May 19, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆370Oct 18, 2024Updated last year
- A reference application for a local AI assistant with LLM and RAG☆123Dec 5, 2024Updated last year
- ☆10Feb 14, 2025Updated last year
- A utility library to help integrate Python applications with Metropolis Microservices for Jetson☆16Dec 21, 2024Updated last year
- Machine Learning Containers for NVIDIA Jetson and JetPack-L4T☆4,682May 18, 2026Updated last week
- A reference example for integrating NanoOwl with Metropolis Microservices for Jetson☆29Jun 14, 2024Updated last year
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- YOLOv5 on Orin DLA☆225Feb 18, 2024Updated 2 years ago
- ROS 2 node for open-vocabulary object detection using NanoOWL.☆37Mar 8, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆88May 13, 2025Updated last year
- Tools to distill the Hiera transformer backbone to CNNs that are easier to deploy on the edge.☆15Dec 4, 2024Updated last year
- ☆16Dec 20, 2021Updated 4 years ago
- An onnx-based quantitation tool.☆71Jan 8, 2024Updated 2 years ago
- This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!☆5,765May 5, 2026Updated 3 weeks ago
- A project demonstrating how to use nvmetamux to run multiple models in parallel.☆113Oct 18, 2024Updated last year
- A simple tool that can generate TensorRT plugin code quickly.☆241Jul 11, 2023Updated 2 years ago
- Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"☆1,151May 24, 2025Updated last year
- This repository provides YOLOV5 GPU optimization sample☆107Jan 6, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Jun 11, 2024Updated last year
- AI kiosk with a camera and a projector to visualize waste type of cafeteria objects☆35Aug 15, 2018Updated 7 years ago
- ☆13Aug 19, 2024Updated last year
- TensorRT In Docker☆11Dec 7, 2024Updated last year
- Sample apps to demonstrate how to deploy models trained with TAO on DeepStream☆451Mar 18, 2026Updated 2 months ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆106Oct 15, 2024Updated last year
- A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, …☆1,808Mar 10, 2026Updated 2 months ago
- A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresse…☆2,750Updated this week
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆86May 26, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection of reference AI microservices and workflows for Jetson Platform Services☆56Jan 28, 2025Updated last year
- pruning vision models in torch☆17Dec 5, 2025Updated 5 months ago
- Reproduction of MobileSAM using pytorch☆21Oct 27, 2023Updated 2 years ago
- NVIDIA-accelerated, deep-learned freespace segmentation☆43Dec 11, 2025Updated 5 months ago
- NVIDIA TensorRT deployment of Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data.☆27May 14, 2024Updated 2 years ago
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆234Jun 10, 2024Updated last year
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,801Mar 12, 2026Updated 2 months ago