Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"
☆124Jan 6, 2026Updated last month
Alternatives and similar repositories for rf100-vl
Users that are interested in rf100-vl are comparing it to the libraries listed below
Sorting:
- [AAAI 2026] SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements☆30Nov 8, 2025Updated 3 months ago
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year
- [ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for…☆5,740Updated this week
- DEIMKit is a Python package that provides a wrapper for DEIM: DETR with Improved Matching for Fast Convergence. Check out the original re…☆112Apr 10, 2025Updated 10 months ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Aug 11, 2022Updated 3 years ago
- Download flickr8k, flickr30k image caption datasets☆42Feb 6, 2024Updated 2 years ago
- Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch☆17Dec 22, 2022Updated 3 years ago
- ☆15Mar 1, 2022Updated 4 years ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆79May 10, 2025Updated 9 months ago
- Tennis Detection and Visualization System An advanced computer vision system for tennis match analysis that tracks players and ball move…☆25Jan 30, 2026Updated last month
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Sep 20, 2021Updated 4 years ago
- The repository provides code for running inference with the Meta Segment Anything Model 3 (SAM 3).☆56Feb 18, 2026Updated 2 weeks ago
- YOLOE: Real-Time Seeing Anything [ICCV 2025]☆2,051Jun 26, 2025Updated 8 months ago
- [DEIMv2] Real Time Object Detection Meets DINOv3☆1,528Jan 7, 2026Updated last month
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆98Jan 7, 2026Updated last month
- Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets☆287Oct 26, 2024Updated last year
- ☆27Feb 20, 2024Updated 2 years ago
- Quantification of Uncertainty with Adversarial Models☆29Jul 11, 2023Updated 2 years ago
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆2,177Feb 11, 2026Updated 2 weeks ago
- This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".☆470Feb 18, 2025Updated last year
- Curso de procesamiento de imágenes con Python☆12Feb 26, 2020Updated 6 years ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆364Aug 31, 2024Updated last year
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆130Nov 5, 2025Updated 3 months ago
- Video descriptions of research papers relating to foundation models and scaling☆29Mar 16, 2023Updated 2 years ago
- ☆32Jul 23, 2022Updated 3 years ago
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆88May 1, 2025Updated 10 months ago
- DETRPose: Real-time end-to-end transformer model for multi-person pose estimation☆72Dec 21, 2025Updated 2 months ago
- SaccadeNet : mimic how human locate accurate bounding box☆29Jul 10, 2019Updated 6 years ago
- Code release for paper "You Only Segment Once: Towards Real-Time Panoptic Segmentation" [CVPR 2023]☆286Jul 21, 2023Updated 2 years ago
- PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.☆34Sep 26, 2024Updated last year
- We propose IAD-R1, a universal post-training framework that enhances Vision-Language Models for industrial anomaly detection through a tw…☆68Dec 9, 2025Updated 2 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Sep 12, 2023Updated 2 years ago
- Official implementation of CVPR2024 Paper "Poly Kernel Inception Network for Remote Sensing Detection".☆77Aug 5, 2024Updated last year
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆213Feb 27, 2024Updated 2 years ago
- Firma electrónica del SRI de Ecuador en python☆11Apr 26, 2022Updated 3 years ago
- Flutter mobile app for recording student attendance☆12Jun 20, 2023Updated 2 years ago
- A complete pipeline for fine-tuning YOLOv8 pose models with custom datasets. Supports automatic and semi-automatic annotation for efficie…☆15Feb 9, 2025Updated last year
- Official Karate Stars App written in Flutter for Android and iOS☆11Jan 19, 2023Updated 3 years ago
- Official repo of Griffon series including v1(ECCV 2024), v2(ICCV 2025), G, and R, and also the RL tool Vision-R1.☆248Aug 12, 2025Updated 6 months ago