rhysdg / vision-at-a-clipLinks
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
☆41Updated 9 months ago
Alternatives and similar repositories for vision-at-a-clip
Users that are interested in vision-at-a-clip are comparing it to the libraries listed below
Sorting:
- Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"☆40Updated last year
- ☆120Updated last year
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆74Updated last month
- Accelerate segment anything model inference using Tensorrt 8.6.1.6☆92Updated last year
- try to export sam2 to onnx.☆55Updated 3 weeks ago
- Deploy RT-EDTR with onnx from paddlepaddle framwork and graph cut☆29Updated 2 years ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆54Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆61Updated this week
- ☆78Updated last year
- ContourFormer:Real-Time Contour-Based End-to-End Instance Segmentation Transformer☆15Updated 3 weeks ago
- triton server ensemble model demo☆30Updated 3 years ago
- Grounding DINO module for use with Autodistill.☆20Updated 11 months ago
- MobileSAM already integrated into Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds☆40Updated last year
- ☆33Updated 11 months ago
- Torchserve + TensorRT + Detection☆19Updated 3 years ago
- Zero-label image classification via OpenCLIP knowledge distillation☆125Updated last year
- ☆28Updated 2 years ago
- DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom framewor…☆51Updated 7 months ago
- ☆53Updated 3 years ago
- ☆37Updated last year
- Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"☆61Updated 3 weeks ago
- ☆77Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆109Updated last month
- YOLO-World + EfficientViT SAM☆98Updated last year
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆18Updated 10 months ago
- Tensorrt codebase to inference in c++ for all major neural arch using onnx☆35Updated 3 months ago
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆75Updated 3 weeks ago
- Segment Anything Model 2 CPP Wrapper for macOS and Ubuntu CPU/GPU☆137Updated this week
- Python scripts for the Segment Anythin 2 (SAM2) model in ONNX☆249Updated 9 months ago