dusty-nv / NanoDB
Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP
☆49Updated 9 months ago
Alternatives and similar repositories for NanoDB:
Users that are interested in NanoDB are comparing it to the libraries listed below
- CLIP and SigLIP models optimized with TensorRT with a Transformers-like API☆22Updated 5 months ago
- A utility library to help integrate Python applications with Metropolis Microservices for Jetson☆12Updated 3 months ago
- A collection of reference AI microservices and workflows for Jetson Platform Services☆38Updated last month
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆41Updated 6 months ago
- Compare Savant and PyTorch performance☆13Updated last year
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆49Updated 11 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆62Updated 7 months ago
- Inference TinyLlama models on ncnn☆24Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆38Updated 2 years ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆74Updated 5 months ago
- A reference example for integrating NanoOwl with Metropolis Microservices for Jetson☆30Updated 9 months ago
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆50Updated this week
- Model compression for ONNX☆87Updated 4 months ago
- EdgeSAM model for use with Autodistill.☆26Updated 9 months ago
- ☆93Updated 6 months ago
- ☆101Updated this week
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆249Updated 5 months ago
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Updated last year
- A reference application for a local AI assistant with LLM and RAG☆108Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated 9 months ago
- YOLOv10 C++ TensorRT : Real-Time End-to-End Object Detection☆20Updated 7 months ago
- Inference Llama 2 in C++☆44Updated 10 months ago
- Stable Diffusion in TensorRT 8.5+☆14Updated 2 years ago
- Quick start scripts and tutorial notebooks to get started with TAO Toolkit☆74Updated 6 months ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆15Updated 4 months ago
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆72Updated 2 weeks ago
- EfficientViT is a new family of vision models for efficient high-resolution vision.☆24Updated last year
- LightNet is an optimized deep learning framework based on the popular darknet platform. It is optimized to create efficient and high-spee…☆37Updated last year