[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.
☆5,928Mar 20, 2026Updated this week
Alternatives and similar repositories for rf-detr
Users that are interested in rf-detr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]☆3,060Jan 5, 2026Updated 2 months ago
- Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0…☆3,066Mar 17, 2026Updated last week
- [CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥…☆4,967Mar 2, 2026Updated 3 weeks ago
- YOLOE: Real-Time Seeing Anything [ICCV 2025]☆2,075Jun 26, 2025Updated 8 months ago
- Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"☆127Updated this week
- [CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence☆1,458Sep 26, 2025Updated 5 months ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,659Mar 16, 2026Updated last week
- [NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors☆2,814Feb 18, 2026Updated last month
- We write your reusable computer vision tools. 💜☆36,750Updated this week
- [DEIMv2] Real Time Object Detection Meets DINOv3☆1,575Jan 7, 2026Updated 2 months ago
- Turn any computer or edge device into a command center for your computer vision projects.☆2,229Updated this week
- [CVPR 2024] Real-Time Open-Vocabulary Object Detection☆6,253Feb 26, 2025Updated last year
- Ultralytics YOLO 🚀☆54,781Updated this week
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆18,737Updated this week
- An MIT License of YOLOv9, YOLOv7, YOLO-RD☆1,632Mar 16, 2026Updated last week
- Images to inference with no labeling (use foundation models to train supervised models).☆2,650May 14, 2025Updated 10 months ago
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆7,051Mar 18, 2025Updated last year
- Framework agnostic sliced/tiled inference + interactive ui + error analysis plots☆5,169Mar 16, 2026Updated last week
- Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information☆9,494Aug 9, 2024Updated last year
- Official implementation of the WACV 2025 ( Oral ) paper. RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positiv…☆321Mar 18, 2025Updated last year
- Reference PyTorch implementation and models for DINOv3☆9,878Mar 11, 2026Updated last week
- Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.☆5,017Feb 24, 2026Updated last month
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures l…☆9,253Updated this week
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆893Jan 27, 2026Updated last month
- [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"☆9,867Aug 12, 2024Updated last year
- YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]☆11,262Mar 14, 2025Updated last year
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,365May 1, 2025Updated 10 months ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,553Mar 12, 2026Updated last week
- Effortless data labeling with AI support from Segment Anything and other awesome models.☆8,420Mar 9, 2026Updated 2 weeks ago
- This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".☆478Feb 18, 2025Updated last year
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆18,671Jan 30, 2026Updated last month
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,348Jul 23, 2025Updated 8 months ago
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2☆3,346Nov 11, 2025Updated 4 months ago
- tiny vision language model☆9,427Nov 14, 2025Updated 4 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,104Feb 10, 2025Updated last year
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆53,684Sep 18, 2024Updated last year
- [ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box☆6,170Jun 19, 2024Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,477Mar 1, 2026Updated 3 weeks ago
- YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documenta…☆10,391Jun 8, 2025Updated 9 months ago