OVAD: Open-vocabulary Attribute Detection code
☆31Aug 28, 2023Updated 2 years ago
Alternatives and similar repositories for ovad-benchmark-code
Users that are interested in ovad-benchmark-code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆106Apr 7, 2023Updated 3 years ago
- Official implementation of TagAlign☆37Dec 11, 2024Updated last year
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆35Aug 4, 2023Updated 2 years ago
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆39Feb 24, 2025Updated last year
- Introduce a novel Video Trimming (VT) task and proposes an agent-based approach (AVT) for detecting wasted footage, selecting valuable se…☆24Jan 20, 2025Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆62Jun 6, 2025Updated 10 months ago
- Code Implementation of "Unsupervised Recognition of Unknown Objects for Open-World Object Detection"☆33Oct 13, 2023Updated 2 years ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆80Oct 25, 2024Updated last year
- Official pyTorch implementation of Transformer-based PAUP model for sequential recommentation, SIGIR 2022☆10Sep 8, 2022Updated 3 years ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆25May 30, 2024Updated last year
- [ICLR 2024 Oral] Less is More: Fewer Interpretable Region via Submodular Subset Selection☆88Oct 27, 2025Updated 5 months ago
- Learning from Noisy Anchors for One-stage Object Detection☆27Apr 14, 2021Updated 4 years ago
- ☆10Sep 29, 2022Updated 3 years ago
- This repository contains the implementation and the building blocks of GlideNet and Informed Convolution. This work is published at CVPR …☆30Mar 7, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [NeurIPS 2024 Spotlight] code for "Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement"☆19Jan 26, 2025Updated last year
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆64Mar 27, 2023Updated 3 years ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆38Aug 18, 2024Updated last year
- ☆10Apr 7, 2025Updated last year
- Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation☆62Jan 15, 2025Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- [ICML 2024] Fine-Grained Classes and How to Find Them☆13Jun 21, 2024Updated last year
- [NeurIPS 2023] Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models☆22Oct 21, 2025Updated 5 months ago
- This repository contains the source codes for the paper: "SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environm…☆16Oct 11, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated 10 months ago
- Repository for SF2SE3: Clustering Scene Flow into SE(3)-Motions via Proposal and Selection☆12Jul 26, 2024Updated last year
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆24Mar 6, 2026Updated last month
- ☆16Oct 3, 2023Updated 2 years ago
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆64Jan 6, 2026Updated 3 months ago
- ☆10Aug 22, 2023Updated 2 years ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆65Oct 24, 2025Updated 5 months ago
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆94Jan 16, 2024Updated 2 years ago
- [NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"☆105Nov 9, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆22Jun 6, 2024Updated last year
- ☆19Aug 1, 2025Updated 8 months ago
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Jan 5, 2025Updated last year
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆28Nov 8, 2023Updated 2 years ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆45Jun 14, 2023Updated 2 years ago
- [Pattern Recognition 25] CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks☆472Mar 1, 2025Updated last year
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆183Mar 4, 2024Updated 2 years ago