AILab-CVC/YOLO-World

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AILab-CVC/YOLO-World)

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

☆6,478

Alternatives and similar repositories for YOLO-World

Users that are interested in YOLO-World are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IDEA-Research / GroundingDINO
View on GitHub
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
☆10,441Aug 12, 2024Updated last year
WongKinYiu / yolov9
View on GitHub
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
☆9,541Aug 9, 2024Updated last year
IDEA-Research / Grounded-Segment-Anything
View on GitHub
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …
☆17,685Sep 5, 2024Updated last year
THU-MIG / yoloe
View on GitHub
YOLOE: Real-Time Seeing Anything [ICCV 2025]
☆2,215Jun 26, 2025Updated last year
IDEA-Research / T-Rex
View on GitHub
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
☆2,690Oct 15, 2025Updated 9 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
facebookresearch / sam2
View on GitHub
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆19,590May 30, 2026Updated last month
yformer / EfficientSAM
View on GitHub
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
☆2,486Dec 24, 2024Updated last year
IDEA-Research / Grounding-DINO-1.5-API
View on GitHub
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
☆1,139Jan 21, 2025Updated last year
lyuwenyu / RT-DETR
View on GitHub
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥…
☆5,400Jun 15, 2026Updated last month
THU-MIG / yolov10
View on GitHub
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
☆11,332Mar 14, 2025Updated last year
LiheYoung / Depth-Anything
View on GitHub
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
☆8,168Jul 17, 2024Updated 2 years ago
ultralytics / ultralytics
View on GitHub
Ultralytics YOLO26, YOLO11, YOLOv8 — object detection, instance segmentation, semantic segmentation, image classification, pose estimatio…
☆59,834Updated this week
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆13,157Jun 3, 2026Updated last month
haotian-liu / LLaVA
View on GitHub
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆24,942Aug 12, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / segment-anything
View on GitHub
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…
☆54,597Sep 18, 2024Updated last year
microsoft / GLIP
View on GitHub
Grounded Language-Image Pre-training
☆2,605Jan 24, 2024Updated 2 years ago
ChaoningZhang / MobileSAM
View on GitHub
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
☆5,826May 5, 2026Updated 2 months ago
CASIA-LMC-Lab / FastSAM
View on GitHub
Fast Segment Anything
☆8,384Jul 30, 2024Updated last year
FoundationVision / GLEE
View on GitHub
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
☆1,172Oct 21, 2024Updated last year
autodistill / autodistill
View on GitHub
Images to inference with no labeling (use foundation models to train supervised models).
☆2,746May 14, 2025Updated last year
xinyu1205 / recognize-anything
View on GitHub
Open-source and strong foundation image recognition models.
☆3,691Feb 18, 2025Updated last year
siyuanliii / masa
View on GitHub
Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything
☆1,375May 1, 2025Updated last year
facebookresearch / dinov3
View on GitHub
Reference PyTorch implementation and models for DINOv3
☆11,004Jul 15, 2026Updated last week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
OpenGVLab / InternVL
View on GitHub
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
☆10,101Sep 22, 2025Updated 10 months ago
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆34,063Mar 25, 2026Updated 3 months ago
wanghao9610 / OV-DINO
View on GitHub
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
☆408Mar 12, 2025Updated last year
UX-Decoder / Segment-Everything-Everywhere-All-At-Once
View on GitHub
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
☆4,795Aug 19, 2024Updated last year
CVHub520 / X-AnyLabeling
View on GitHub
Effortless data labeling with AI support from Segment Anything and other awesome models.
☆9,871Updated this week
UX-Decoder / Semantic-SAM
View on GitHub
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
☆2,853Jul 10, 2025Updated last year
WongKinYiu / yolov7
View on GitHub
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
☆14,133Aug 19, 2024Updated last year
open-mmlab / mmdetection
View on GitHub
OpenMMLab Detection Toolbox and Benchmark
☆32,838Aug 21, 2024Updated last year
QwenLM / Qwen3-VL
View on GitHub
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆19,662Jan 30, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆14,019Jul 17, 2026Updated last week
SysCV / sam-hq
View on GitHub
Segment Anything in High Quality [NeurIPS 2023]
☆4,246Sep 12, 2025Updated 10 months ago
IDEA-Research / Grounded-SAM-2
View on GitHub
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
☆3,654Nov 11, 2025Updated 8 months ago
IDEA-Research / DINO-X-API
View on GitHub
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
☆1,400Jul 23, 2025Updated last year
IDEA-Research / DINO
View on GitHub
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
☆2,826Jul 31, 2024Updated last year
QwenLM / Qwen-VL
View on GitHub
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
☆6,713Aug 7, 2024Updated last year
open-mmlab / mmyolo
View on GitHub
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
☆3,460Jul 14, 2024Updated 2 years ago