rhysdg / vision-at-a-clip
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
☆23Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for vision-at-a-clip
- Deploy RT-EDTR with onnx from paddlepaddle framwork and graph cut☆27Updated last year
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆23Updated last year
- ☆108Updated last year
- Zero-label image classification via OpenCLIP knowledge distillation☆113Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆33Updated last month
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆14Updated 4 months ago
- ☆160Updated 3 months ago
- Grounding DINO module for use with Autodistill.☆18Updated 4 months ago
- Accelerate segment anything model inference using Tensorrt 8.6.1.6☆82Updated last year
- EfficientViT is a new family of vision models for efficient high-resolution vision.☆22Updated last year
- Tensorrt codebase to inference in c++ for all major neural arch using onnx☆20Updated 2 months ago
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆66Updated 3 months ago
- YOLO-World + EfficientViT SAM☆75Updated 8 months ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- EdgeSAM model for use with Autodistill.☆25Updated 5 months ago
- ☆50Updated 2 years ago
- AutoTrackAnything is a universal, flexible and interactive tool for insane automatic object tracking over thousands of frames. It is deve…☆73Updated 7 months ago
- try to export sam2 to onnx.☆19Updated last month
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆90Updated 3 months ago
- ☆29Updated last month
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆31Updated 3 years ago
- DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom framewor…☆25Updated 3 weeks ago
- ☆25Updated last year
- NVIDIA DeepStream SDK 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 application for YOLO-Face models☆54Updated last year
- Python scripts for the Segment Anythin 2 (SAM2) model in ONNX☆167Updated 2 months ago
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆53Updated 3 weeks ago
- Official repository of the first-ranking solution for the UPAR2024 Challenge - Track 1.☆21Updated 10 months ago
- HunyuanDiT with TensorRT and libtorch☆15Updated 5 months ago
- yolov8 model with SAM meta☆121Updated 11 months ago