adobe-research / vaw_dataset
This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild" and the ECCV 2022 paper titled "Improving Closed and Open-Vocabulary Attribute Prediction using Transformers"
☆63Updated 2 years ago
Alternatives and similar repositories for vaw_dataset:
Users that are interested in vaw_dataset are comparing it to the libraries listed below
- ☆64Updated last year
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆66Updated 2 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Updated 3 years ago
- [ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"☆100Updated 2 years ago
- ☆56Updated 2 years ago
- [ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and …☆62Updated 3 years ago
- ☆26Updated last year
- ☆83Updated 3 years ago
- "Describing Textures using Natural Language" code and data, ECCV 2020 Oral.☆17Updated 4 years ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆35Updated last year
- Code for the CVPR 2020 oral paper: Weakly Supervised Visual Semantic Parsing☆35Updated 2 years ago
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆60Updated 2 years ago
- This repo is the official implementation of UPL (Unsupervised Prompt Learning for Vision-Language Models).☆114Updated 3 years ago
- kdexd/coco-caption@de6f385☆26Updated 4 years ago
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆110Updated 4 years ago
- ☆26Updated last year
- ☆26Updated 3 years ago
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Updated 2 years ago
- ☆29Updated last year
- ☆59Updated 3 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆68Updated 3 years ago
- ☆50Updated 2 years ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated 2 years ago
- ☆53Updated 3 years ago
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆66Updated 2 years ago
- A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization☆21Updated 2 months ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆32Updated last year
- Reliably download millions of images efficiently☆115Updated 4 years ago