stevebottos/owl-vit-object-detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stevebottos/owl-vit-object-detection)

stevebottos / owl-vit-object-detection

object detection based on owl-vit

☆68

Alternatives and similar repositories for owl-vit-object-detection

Users that are interested in owl-vit-object-detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sharad5 / OWL-ViT-Object-Detection
View on GitHub
Capstone Project: Training and Finetuning for OWL ViT for Referring Expression Task
☆12Jan 13, 2024Updated 2 years ago
lorebianchi98 / FG-OVD
View on GitHub
[CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…
☆68Apr 4, 2025Updated last year
densechen / Pose-refinement
View on GitHub
Pose refinement with differentiable rendering
☆10Dec 27, 2020Updated 5 years ago
NVIDIA-AI-IOT / nanoowl
View on GitHub
A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.
☆442Feb 6, 2025Updated last year
witnessai / Awesome-Open-Vocabulary-Object-Detection
View on GitHub
A curated list of papers, datasets and resources pertaining to open vocabulary object detection.
☆422May 13, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
FrankFundel / SGCond
View on GitHub
☆10Jun 28, 2023Updated 3 years ago
Charles-Xie / awesome-described-object-detection
View on GitHub
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…
☆358Nov 6, 2025Updated 8 months ago
HappyAIWalker / ICCV2023-paper-code
View on GitHub
ICCV2023论文代码汇总
☆18Aug 12, 2023Updated 2 years ago
Yuqifan1117 / CaCao
View on GitHub
This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World"…
☆49Mar 12, 2024Updated 2 years ago
Georgelingzj / up-to-date-Vision-Language-Models
View on GitHub
Up-to-date Vision Language Models collection. Mainly focus on computer vision
☆20Feb 9, 2023Updated 3 years ago
tinnunculus / Mask2Former
View on GitHub
☆10May 26, 2022Updated 4 years ago
seanzhuh / Awesome-Open-Vocabulary-Detection-and-Segmentation
View on GitHub
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
☆219Apr 3, 2025Updated last year
ludc506 / InternVL-X
View on GitHub
☆16Mar 26, 2025Updated last year
causalNLP / amr_llm
View on GitHub
This repo explores how AMR to address tasks difficult for LLMs
☆13Jan 15, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
DataXujing / YOLOv12-TensorRT
View on GitHub
YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现
☆14Mar 5, 2025Updated last year
ChenShawn / MultiModal-Jupyter-Sandbox
View on GitHub
Simple code sandbox supporting jupyter notebook style code execution. Used for agent training
☆25Dec 5, 2025Updated 7 months ago
windowsub0406 / 3D-Lidar-visualization-VLP-16
View on GitHub
visualization program for vlp-16 based on a viz class
☆11Feb 8, 2017Updated 9 years ago
Ruggero1912 / Patch-ioner
View on GitHub
[CVPR 2026] Official Repository of the Paper "One Patch to Caption Them All A Unified Zero-Shot Captioning Framework"
☆15Jun 4, 2026Updated last month
emanuelevivoli / ComiCap
View on GitHub
[ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"
☆15Nov 20, 2024Updated last year
ngonthier / Mi_max
View on GitHub
Multi Instance Perceptron for weakly supervised transfer learning of deep detector - Weakly Supervised Object Detection in Artworks
☆16May 29, 2024Updated 2 years ago
iLearn-Lab / CVPR22-SHA-GCL-for-SGG
View on GitHub
Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"
☆39Apr 8, 2026Updated 3 months ago
weningerleon / BraTS2018
View on GitHub
☆16Feb 3, 2020Updated 6 years ago
LutingWang / OADP
View on GitHub
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection
☆64Jan 6, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Hugo0512 / Spinecube
View on GitHub
☆14Jun 28, 2025Updated last year
aimagelab / DICE
View on GitHub
[ICCV 2025] What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
☆16Nov 3, 2025Updated 8 months ago
yuhangzang / ContextDET
View on GitHub
Contextual Object Detection with Multimodal Large Language Models
☆261Oct 14, 2024Updated last year
soumyadbanik / object-detection-on-aerial-videos
View on GitHub
This repo contains the codes and steps to perform object detection on stanford drone dataset
☆16Dec 27, 2023Updated 2 years ago
waveshare / JETANK
View on GitHub
JETANK is an open-source robot based on NVIDIA Jetson Nano
☆13Jul 30, 2021Updated 4 years ago
Annusha / LCwoF
View on GitHub
Generalized and Incremental Few-Shot Learning by Explicit Learning and Calibration without Forgetting, (ICCV'21)
☆14Aug 4, 2022Updated 3 years ago
jimazeyu / franka_grasp_baseline
View on GitHub
Table top manipulation calibration between the robot arm, the fixed cameras and the camera in hand.
☆13Apr 12, 2024Updated 2 years ago
wenyi5608 / GroundingDINO
View on GitHub
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
☆46Sep 26, 2023Updated 2 years ago
longzw1997 / Open-GroundingDino
View on GitHub
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detectio…
☆840Jul 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Ernestchenchen / CurvNet
View on GitHub
CurvNet: Latent Contour Representation and Iterative Data Engine for Curvature Angle Estimation
☆15Oct 4, 2025Updated 9 months ago
baoqianyue / ImageProcessSamples
View on GitHub
Some examples of image processing based on Opencv
☆17Feb 22, 2019Updated 7 years ago
tgxs002 / CORA
View on GitHub
A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023
☆202Apr 16, 2023Updated 3 years ago
lukasknobel / SelfCollages
View on GitHub
Learning to Count without Annotations
☆23May 24, 2024Updated 2 years ago
UX-Decoder / DINOv
View on GitHub
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
☆542Apr 8, 2024Updated 2 years ago
I-Am-Timothy-Williams / scoliosis-detection
View on GitHub
First training data to classify spine x-rays as normal, scoliosis and spondylosis. Then trying to use image segmentation techniques to se…
☆16Sep 18, 2024Updated last year
jianzongwu / Awesome-Open-Vocabulary
View on GitHub
(TPAMI 2024) A Survey on Open Vocabulary Learning
☆999May 12, 2026Updated 2 months ago