[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.
☆5,740Feb 28, 2026Updated this week
Alternatives and similar repositories for rf-detr
Users that are interested in rf-detr are comparing it to the libraries listed below
Sorting:
- D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]☆3,031Jan 5, 2026Updated last month
- Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0…☆2,888Updated this week
- [CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥…☆4,892Dec 3, 2025Updated 3 months ago
- YOLOE: Real-Time Seeing Anything [ICCV 2025]☆2,051Jun 26, 2025Updated 8 months ago
- We write your reusable computer vision tools. 💜☆36,612Updated this week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,659Feb 23, 2026Updated last week
- [CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence☆1,437Sep 26, 2025Updated 5 months ago
- [NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors☆2,800Feb 18, 2026Updated 2 weeks ago
- [CVPR 2024] Real-Time Open-Vocabulary Object Detection☆6,217Feb 26, 2025Updated last year
- Turn any computer or edge device into a command center for your computer vision projects.☆2,205Updated this week
- [DEIMv2] Real Time Object Detection Meets DINOv3☆1,528Jan 7, 2026Updated last month
- Ultralytics YOLO 🚀☆53,788Updated this week
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆18,560Dec 25, 2024Updated last year
- Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"☆124Jan 6, 2026Updated last month
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆7,043Mar 18, 2025Updated 11 months ago
- Images to inference with no labeling (use foundation models to train supervised models).☆2,634May 14, 2025Updated 9 months ago
- Framework agnostic sliced/tiled inference + interactive ui + error analysis plots☆5,137Feb 12, 2026Updated 2 weeks ago
- Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information☆9,481Aug 9, 2024Updated last year
- Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.☆5,007Feb 24, 2026Updated last week
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆877Jan 27, 2026Updated last month
- Official implementation of the WACV 2025 ( Oral ) paper. RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positiv…☆317Mar 18, 2025Updated 11 months ago
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures l…☆9,218Updated this week
- An MIT License of YOLOv9, YOLOv7, YOLO-RD☆1,608Dec 30, 2025Updated 2 months ago
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,364May 1, 2025Updated 10 months ago
- Reference PyTorch implementation and models for DINOv3☆9,670Feb 17, 2026Updated 2 weeks ago
- YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]☆11,226Mar 14, 2025Updated 11 months ago
- [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"☆9,760Aug 12, 2024Updated last year
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,427Feb 24, 2026Updated last week
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆18,386Jan 30, 2026Updated last month
- Effortless data labeling with AI support from Segment Anything and other awesome models.☆8,229Feb 21, 2026Updated last week
- tiny vision language model☆9,364Nov 14, 2025Updated 3 months ago
- This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".☆470Feb 18, 2025Updated last year
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,340Jul 23, 2025Updated 7 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,089Feb 10, 2025Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,360Feb 24, 2026Updated last week
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2☆3,292Nov 11, 2025Updated 3 months ago
- [ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy☆2,637Oct 15, 2025Updated 4 months ago
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆53,497Sep 18, 2024Updated last year
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆53,029Updated this week