OVAD: Open-vocabulary Attribute Detection code
☆31Aug 28, 2023Updated 2 years ago
Alternatives and similar repositories for ovad-benchmark-code
Users that are interested in ovad-benchmark-code are comparing it to the libraries listed below
Sorting:
- Official implementation of TagAlign☆37Dec 11, 2024Updated last year
- This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in th…☆69Jul 22, 2022Updated 3 years ago
- This the official repository of OCL (ICCV 2023).☆26Mar 28, 2024Updated last year
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆39Feb 24, 2025Updated last year
- [AAAI 2026] Relation-R1: Progressively Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relation Comprehension☆18Mar 6, 2026Updated 2 weeks ago
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆47Sep 25, 2023Updated 2 years ago
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆62Jun 6, 2025Updated 9 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆80Oct 25, 2024Updated last year
- Code use to create COCO Attributes dataset and experiments in the associate ECCV 2016 paper.☆49Dec 26, 2022Updated 3 years ago
- Official pyTorch implementation of Transformer-based PAUP model for sequential recommentation, SIGIR 2022☆10Sep 8, 2022Updated 3 years ago
- [ICLR 2024 Oral] Less is More: Fewer Interpretable Region via Submodular Subset Selection☆88Oct 27, 2025Updated 4 months ago
- ☆25Aug 1, 2023Updated 2 years ago
- Learning from Noisy Anchors for One-stage Object Detection☆27Apr 14, 2021Updated 4 years ago
- A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)☆62May 7, 2024Updated last year
- ☆99Jun 23, 2025Updated 8 months ago
- This repository contains the implementation and the building blocks of GlideNet and Informed Convolution. This work is published at CVPR …☆29Mar 7, 2022Updated 4 years ago
- [NeurIPS 2024 Spotlight] code for "Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement"☆19Jan 26, 2025Updated last year
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆64Mar 27, 2023Updated 2 years ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆38Aug 18, 2024Updated last year
- Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation☆62Jan 15, 2025Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆62Oct 24, 2025Updated 4 months ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated 9 months ago
- This repository contains the source codes for the paper: "SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environm…☆16Oct 11, 2021Updated 4 years ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆24Mar 6, 2026Updated 2 weeks ago
- ☆16Oct 3, 2023Updated 2 years ago
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆64Jan 6, 2026Updated 2 months ago
- ☆10Aug 22, 2023Updated 2 years ago
- Implementation of "Spectral Feature Tansformation for Person Re-identification"☆31Sep 7, 2019Updated 6 years ago
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆94Jan 16, 2024Updated 2 years ago
- ☆18Aug 1, 2025Updated 7 months ago
- ☆21Jun 6, 2024Updated last year
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Jan 5, 2025Updated last year
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆28Nov 8, 2023Updated 2 years ago
- This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.☆13Mar 18, 2024Updated 2 years ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆45Jun 14, 2023Updated 2 years ago
- [Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks☆467Mar 1, 2025Updated last year
- Diffusion Models Tutorials☆15Apr 10, 2023Updated 2 years ago
- CVPR 2024 Official Repository☆12Mar 27, 2024Updated last year