roboflow / rf100-vlLinks

Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"

☆72

Alternatives and similar repositories for rf100-vl

Users that are interested in rf100-vl are comparing it to the libraries listed below

Sorting:

WongKinYiu / GeneralistYOLO
Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models
☆77Updated 3 months ago
clxia12 / RT-DETRv3
Official implementation of the WACV 2025 ( Oral ) paper. RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positiv…
☆216Updated 4 months ago
ChungYi347 / Interactive-Multi-Class-Tiny-Object-Detection
☆149Updated 3 years ago
LilianHollard / LeYOLO
☆193Updated 2 months ago
roboflow / roboflow-100-benchmark
Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets
☆271Updated 9 months ago
rhysdg / vision-at-a-clip
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
☆42Updated 11 months ago
IDEA-Research / RexSeek
[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark
☆149Updated 3 months ago
roboflow / model-leaderboard
Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.
☆84Updated last week
ouyanghaodong / DEYO
Implementation of paper - DEYO: DETR with YOLO for End-to-End Object Detection
☆94Updated last year
Atten4Vis / LW-DETR
This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".
☆379Updated 5 months ago
laclouis5 / globox
A package to read and convert object detection datasets (COCO, YOLO, PascalVOC, LabelMe, CVAT, OpenImage, ...) and evaluate them with COC…
☆207Updated 2 weeks ago
MiXaiLL76 / faster_coco_eval
Continuation of an abandoned project fast-coco-eval
☆117Updated last month
Hzzone / PseCo
(CVPR 2024) Point, Segment and Count: A Generalized Framework for Object Counting
☆115Updated 8 months ago
dnth / DEIMKit
DEIMKit is a Python package that provides a wrapper for DEIM: DETR with Improved Matching for Fast Convergence. Check out the original re…
☆68Updated 3 months ago
autodistill / autodistill-yolov8
YOLOv8 Target Model plugin for Autodistill
☆44Updated last year
ArgoHA / custom_d_fine
D-FINE: SoTA Object Detection model custom training/exporting/inferencing pipeline from scratch
☆48Updated 2 weeks ago
Xuan-World / Mamba-YOLO-World
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
☆89Updated 4 months ago
AyushExel / trolo
An SDK for Transformers + YOLO and other SSD family models
☆63Updated 6 months ago
NVIDIA-AI-IOT / jetson-intro-to-distillation
A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson
☆206Updated last year
NVIDIA-AI-IOT / clip-distillation
Zero-label image classification via OpenCLIP knowledge distillation
☆134Updated last year
jahongir7174 / YOLOv8-pt
YOLOv8 implementation using PyTorch
☆163Updated last year
dnth / yolov5-deepsparse-blogpost
By the end of this post, you will learn how to: Train a SOTA YOLOv5 model on your own data. Sparsify the model using SparseML quantizati…
☆55Updated 2 years ago
akashAD98 / YOLOV8_SAM
yolov8 model with SAM meta
☆139Updated last year
voxel51 / reconstruction-error-ratios
Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!
☆25Updated 6 months ago
iSEE-Laboratory / LLMDet
(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…
☆337Updated last week
NVIDIAAICITYCHALLENGE / 2024AICITY_Code_From_Top_Teams
☆49Updated last year
niki-amini-naieni / CountGD
Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.
☆271Updated last month
autodistill / autodistill-grounded-sam-2
Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.
☆126Updated last year
miquel-espinosa / no-time-to-train
Official code for "No time to train! Training-Free Reference-Based Instance Segmentation"
☆165Updated 2 weeks ago
chengche6230 / ReST
[ICCV 2023] ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking
☆160Updated last year