THU-MIG/yoloe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/THU-MIG/yoloe)

THU-MIG / yoloe

YOLOE: Real-Time Seeing Anything [ICCV 2025]

☆2,200

Alternatives and similar repositories for yoloe

Users that are interested in yoloe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AILab-CVC / YOLO-World
View on GitHub
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
☆6,454Feb 26, 2025Updated last year
sunsmarterjie / yolov12
View on GitHub
[NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors
☆2,921May 14, 2026Updated last month
IDEA-Research / DINO-X-API
View on GitHub
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
☆1,395Jul 23, 2025Updated 11 months ago
THU-MIG / YOLO-UniOW
View on GitHub
YOLO-UniOW: Efficient Universal Open-World Object Detection
☆187Jan 17, 2025Updated last year
Peterande / D-FINE
View on GitHub
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
☆3,213Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
roboflow / rf-detr
View on GitHub
RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning…
☆8,457Updated this week
Intellindust-AI-Lab / DEIM
View on GitHub
[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence
☆1,569Mar 24, 2026Updated 3 months ago
lyuwenyu / RT-DETR
View on GitHub
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥…
☆5,358Jun 15, 2026Updated 3 weeks ago
THU-MIG / yolov10
View on GitHub
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
☆11,318Mar 14, 2025Updated last year
IDEA-Research / GroundingDINO
View on GitHub
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
☆10,377Aug 12, 2024Updated last year
IDEA-Research / T-Rex
View on GitHub
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
☆2,682Oct 15, 2025Updated 8 months ago
IDEA-Research / RexSeek
View on GitHub
[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark
☆184Oct 15, 2025Updated 8 months ago
Atten4Vis / LW-DETR
View on GitHub
This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".
☆504Feb 18, 2025Updated last year
IDEA-Research / Grounding-DINO-1.5-API
View on GitHub
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
☆1,136Jan 21, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
facebookresearch / sam2
View on GitHub
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆19,501May 30, 2026Updated last month
CVHub520 / X-AnyLabeling
View on GitHub
Effortless data labeling with AI support from Segment Anything and other awesome models.
☆9,675Jul 5, 2026Updated last week
wanghao9610 / OV-DINO
View on GitHub
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
☆407Mar 12, 2025Updated last year
siyuanliii / masa
View on GitHub
Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything
☆1,376May 1, 2025Updated last year
WongKinYiu / yolov9
View on GitHub
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
☆9,535Aug 9, 2024Updated last year
Westlake-AGI-Lab / Distill-Any-Depth
View on GitHub
The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"
☆692Apr 21, 2025Updated last year
IDEA-Research / Grounded-SAM-2
View on GitHub
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
☆3,622Nov 11, 2025Updated 8 months ago
ultralytics / ultralytics
View on GitHub
Ultralytics YOLO26, YOLO11, YOLOv8 — object detection, instance segmentation, semantic segmentation, image classification, pose estimatio…
☆59,125Updated this week
facebookresearch / perception_models
View on GitHub
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
☆2,311Apr 13, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
IDEA-Research / Grounded-Segment-Anything
View on GitHub
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …
☆17,656Sep 5, 2024Updated last year
facebookresearch / dinov3
View on GitHub
Reference PyTorch implementation and models for DINOv3
☆10,885Jun 15, 2026Updated 3 weeks ago
ChaoningZhang / MobileSAM
View on GitHub
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
☆5,804May 5, 2026Updated 2 months ago
343gltysprk / ovow
View on GitHub
☆38Nov 25, 2025Updated 7 months ago
iSEE-Laboratory / LLMDet
View on GitHub
(CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…
☆603Feb 4, 2026Updated 5 months ago
autodistill / autodistill
View on GitHub
Images to inference with no labeling (use foundation models to train supervised models).
☆2,739May 14, 2025Updated last year
yangchris11 / samurai
View on GitHub
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
☆7,092Mar 18, 2025Updated last year
apple / ml-mobileclip
View on GitHub
This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025
☆1,578Apr 15, 2026Updated 2 months ago
chongzhou96 / EdgeSAM
View on GitHub
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
☆1,160May 24, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆13,073Jun 3, 2026Updated last month
CASIA-LMC-Lab / FastSAM
View on GitHub
Fast Segment Anything
☆8,368Jul 30, 2024Updated last year
THU-MIG / RepViT
View on GitHub
RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything
☆1,103Jun 14, 2024Updated 2 years ago
obss / sahi
View on GitHub
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
☆5,394Updated this week
Tencent / YOLO-Master
View on GitHub
[CVPR2026]🚀🚀🚀Official code for the paper "YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detectio…
☆579Updated this week
IDEA-Research / DINO
View on GitHub
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
☆2,824Jul 31, 2024Updated last year
NVlabs / describe-anything
View on GitHub
[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning
☆1,503Jun 26, 2025Updated last year