roboflow / rf100-vlLinks
Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"
☆100Updated last month
Alternatives and similar repositories for rf100-vl
Users that are interested in rf100-vl are comparing it to the libraries listed below
Sorting:
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆172Updated last month
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆84Updated 6 months ago
- Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets☆283Updated last year
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆284Updated 4 months ago
- (CVPR 2024) Point, Segment and Count: A Generalized Framework for Object Counting☆118Updated last year
- Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts☆43Updated last year
- Official implementation of the WACV 2025 ( Oral ) paper. RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positiv…☆286Updated 8 months ago
- Continuation of an abandoned project fast-coco-eval☆129Updated last month
- Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-lan…☆116Updated last year
- ☆194Updated 5 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆92Updated last week
- ☆150Updated 3 years ago
- DEIMKit is a Python package that provides a wrapper for DEIM: DETR with Improved Matching for Fast Convergence. Check out the original re…☆99Updated 7 months ago
- ☆78Updated 7 months ago
- D-FINE: SoTA Object Detection model custom training/exporting/inferencing pipeline from scratch☆72Updated this week
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆93Updated 8 months ago
- An SDK for Transformers + YOLO and other SSD family models☆64Updated 9 months ago
- This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point …☆174Updated 2 years ago
- Focusing on Tracks for Online Multi-Object Tracking☆80Updated last month
- This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".☆434Updated 9 months ago
- ☆56Updated 2 years ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆19Updated last year
- Zero-label image classification via OpenCLIP knowledge distillation☆137Updated 2 years ago
- [ICCV 2023] ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking☆161Updated last year
- Implementation of paper - DEYO: DETR with YOLO for End-to-End Object Detection☆97Updated last year
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆116Updated last year
- Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning☆128Updated 4 months ago
- A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson☆221Updated 2 years ago
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆491Updated 3 months ago
- Combining "segment-anything" with MOT, it create the era of "MOTS"☆155Updated 2 years ago