roboflow / rf100-vlLinks
Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"
☆108Updated 2 months ago
Alternatives and similar repositories for rf100-vl
Users that are interested in rf100-vl are comparing it to the libraries listed below
Sorting:
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆85Updated 7 months ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆172Updated 2 months ago
- Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets☆284Updated last year
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆287Updated 5 months ago
- (CVPR 2024) Point, Segment and Count: A Generalized Framework for Object Counting☆120Updated last year
- ☆194Updated 6 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆93Updated last week
- Continuation of an abandoned project fast-coco-eval☆135Updated last month
- Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts☆43Updated last year
- Official implementation of the WACV 2025 ( Oral ) paper. RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positiv…☆296Updated 8 months ago
- An SDK for Transformers + YOLO and other SSD family models☆65Updated 10 months ago
- This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point …☆176Updated 2 years ago
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆27Updated 11 months ago
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆93Updated 9 months ago
- Zero-label image classification via OpenCLIP knowledge distillation☆139Updated 2 years ago
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆511Updated 3 months ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆86Updated last year
- Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-lan…☆126Updated last year
- Object tracking pipelines complete with RF-DETR, YOLOv9, YOLO-NAS, YOLOv8, and YOLOv7 detection and BYTETracker tracking☆82Updated 6 months ago
- DEIMKit is a Python package that provides a wrapper for DEIM: DETR with Improved Matching for Fast Convergence. Check out the original re…☆102Updated 8 months ago
- ☆56Updated 2 years ago
- Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning☆130Updated 5 months ago
- [ICCV 2023] ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking☆163Updated last year
- Focusing on Tracks for Online Multi-Object Tracking☆81Updated 2 months ago
- ☆152Updated 3 years ago
- This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".☆438Updated 9 months ago
- Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding☆209Updated 2 months ago
- Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.☆78Updated last month
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆19Updated last year
- ☆95Updated 7 months ago