dusty-nv / NanoDBLinks
Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP
☆59Updated 2 months ago
Alternatives and similar repositories for NanoDB
Users that are interested in NanoDB are comparing it to the libraries listed below
Sorting:
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆51Updated last week
- CLIP and SigLIP models optimized with TensorRT with a Transformers-like API☆27Updated 9 months ago
- Model compression for ONNX☆96Updated 7 months ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆55Updated last year
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆48Updated 10 months ago
- JAX bindings for the flash-attention3 kernels☆11Updated 11 months ago
- Inference Llama 2 in C++☆43Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆64Updated 11 months ago
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆76Updated 2 months ago
- ☆99Updated 10 months ago
- ☆27Updated 2 weeks ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆292Updated 8 months ago
- Python scripts performing object detection using the YOLOv9 MIT model in ONNX.☆32Updated 10 months ago
- A collection of reference AI microservices and workflows for Jetson Platform Services☆42Updated 5 months ago
- A tool convert TensorRT engine/plan to a fake onnx☆40Updated 2 years ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 4 months ago
- Simple tool to change the INPUT and OUTPUT shape of ONNX.☆15Updated 3 months ago
- 💡💡💡awesome compute vision app in gradio☆53Updated last year
- Converts CLIP models to ONNX☆11Updated 2 years ago
- Inference TinyLlama models on ncnn☆24Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆18Updated 10 months ago
- EdgeSAM model for use with Autodistill.☆27Updated last year
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆86Updated 9 months ago
- ☆111Updated 3 weeks ago
- LightNet is an optimized deep learning framework based on the popular darknet platform. It is optimized to create efficient and high-spee…☆37Updated last year
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆10Updated last year
- A reference application for a local AI assistant with LLM and RAG☆112Updated 7 months ago
- Object tracking pipelines complete with RF-DETR, YOLOv9, YOLO-NAS, YOLOv8, and YOLOv7 detection and BYTETracker tracking☆77Updated last month
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated 3 months ago
- An ONNX-based implementation of the CLIP model that doesn't depend on torch or torchvision.☆72Updated last year