rhysdg / vision-at-a-clip
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
☆38Updated 7 months ago
Alternatives and similar repositories for vision-at-a-clip:
Users that are interested in vision-at-a-clip are comparing it to the libraries listed below
- ☆116Updated last year
- Deploy RT-EDTR with onnx from paddlepaddle framwork and graph cut☆29Updated last year
- Accelerate segment anything model inference using Tensorrt 8.6.1.6☆89Updated last year
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆90Updated 9 months ago
- Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"☆36Updated last year
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆67Updated last month
- Grounding DINO module for use with Autodistill.☆20Updated 9 months ago
- triton server ensemble model demo☆30Updated 2 years ago
- ☆77Updated last year
- try to export sam2 to onnx.☆47Updated 6 months ago
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆78Updated last month
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆51Updated last year
- ☆35Updated last year
- Zero-label image classification via OpenCLIP knowledge distillation☆125Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆61Updated this week
- Decode JPEG image on GPU using PyTorch☆90Updated last year
- ContourFormer:Real-Time Contour-Based End-to-End Instance Segmentation Transformer☆8Updated last month
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆74Updated last week
- DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom framewor…☆49Updated 6 months ago
- Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding☆182Updated 2 months ago
- SAM-CLIP module for use with Autodistill.☆15Updated last year
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆216Updated last year
- ☆53Updated 3 years ago
- Fine-tuning Qwen2.5-VL for vision-language tasks | Optimized for Vision understanding | LoRA & PEFT support.☆51Updated 2 months ago
- Add MobileSAM support for Inpaint anything using Segment Anything and inpainting models.☆50Updated last year
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆23Updated 2 years ago
- YOLO-World + EfficientViT SAM☆92Updated last year
- tensorrt yolov7 without onnxparser☆24Updated 2 years ago
- MobileSAM already integrated into Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds☆38Updated last year
- Instance and panoptic segmentation using yolov9 in onnx☆11Updated last year