object detection based on owl-vit
☆67Aug 18, 2023Updated 2 years ago
Alternatives and similar repositories for owl-vit-object-detection
Users that are interested in owl-vit-object-detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…☆67Apr 4, 2025Updated last year
- Capstone Project: Training and Finetuning for OWL ViT for Referring Expression Task☆12Jan 13, 2024Updated 2 years ago
- Code for paper 'Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction' (TOMM 2023)☆10Sep 6, 2025Updated 8 months ago
- A curated list of papers, datasets and resources pertaining to open vocabulary object detection.☆419May 13, 2025Updated last year
- Pytorch Implementation of Deepmind's SIMA: "Scaling Instructable Agents Across Many Simulated Worlds"☆34Jun 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆354Nov 6, 2025Updated 6 months ago
- Up-to-date Vision Language Models collection. Mainly focus on computer vision☆19Feb 9, 2023Updated 3 years ago
- ☆10Jun 28, 2023Updated 2 years ago
- This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World"…☆49Mar 12, 2024Updated 2 years ago
- ☆16Mar 26, 2025Updated last year
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆219Apr 3, 2025Updated last year
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- ☆10May 26, 2022Updated 4 years ago
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Nov 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly Detection☆34Aug 9, 2025Updated 9 months ago
- Code for paper 'Leveraging Predicate and Triplet Learning for Scene Graph Generation'. (CVPR 2024)☆32Sep 6, 2025Updated 8 months ago
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆64Jan 6, 2026Updated 4 months ago
- ☆12Aug 19, 2023Updated 2 years ago
- A hot-pluggable tool for visualizing LLaVA's attention.☆24Jan 29, 2024Updated 2 years ago
- Sample notebooks that show the usage of Data Explorer SDK(akride)☆18Jul 24, 2024Updated last year
- Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"☆39Apr 8, 2026Updated last month
- ☆19Jan 30, 2023Updated 3 years ago
- Multi Instance Perceptron for weakly supervised transfer learning of deep detector - Weakly Supervised Object Detection in Artworks☆16May 29, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Contextual Object Detection with Multimodal Large Language Models☆260Oct 14, 2024Updated last year
- ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…☆50Nov 27, 2025Updated 6 months ago
- Official code for the paper "Housekeep: Tidying Virtual Households using Commonsense Reasoning" published at ECCV, 2022☆52Apr 27, 2023Updated 3 years ago
- Table top manipulation calibration between the robot arm, the fixed cameras and the camera in hand.☆13Apr 12, 2024Updated 2 years ago
- Real-Time Semantic Segmentation of Street Scenes☆31Dec 4, 2023Updated 2 years ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,803May 8, 2026Updated 3 weeks ago
- ☆12Feb 16, 2023Updated 3 years ago
- NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants☆12Mar 12, 2023Updated 3 years ago
- Learning to Count without Annotations☆23May 24, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆50Jan 8, 2025Updated last year
- A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023☆201Apr 16, 2023Updated 3 years ago
- Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"☆46Sep 26, 2023Updated 2 years ago
- ☆38Nov 25, 2025Updated 6 months ago
- [CVPR 2024] Official implementation of the paper "Visual In-context Learning"☆538Apr 8, 2024Updated 2 years ago
- (TPAMI 2024) A Survey on Open Vocabulary Learning☆999May 12, 2026Updated 2 weeks ago
- Collect papers about Mamba (a selective state space model).☆15Aug 6, 2024Updated last year