adobe-research / vaw_dataset
This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild" and the ECCV 2022 paper titled "Improving Closed and Open-Vocabulary Attribute Prediction using Transformers"
☆63Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for vaw_dataset
- ☆65Updated last year
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆33Updated last year
- Shared Attention for Multi-label Zero-shot Learning accepted @ CVPR20☆31Updated 2 years ago
- [ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and …☆60Updated 2 years ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆64Updated 2 years ago
- ☆25Updated last year
- ☆56Updated 2 years ago
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆64Updated 2 years ago
- ☆24Updated 3 years ago
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆64Updated 2 years ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated last year
- ☆29Updated last year
- ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations wh…☆24Updated 3 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 2 years ago
- [NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grap…☆71Updated 5 months ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆32Updated last year
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆65Updated 2 years ago
- [CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》☆62Updated 2 years ago
- ☆35Updated last year
- [ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"☆100Updated last year
- [ECCV'22 Poster] Explicit Image Caption Editing☆21Updated last year
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆72Updated 4 years ago
- [CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)☆52Updated 2 years ago
- "Describing Textures using Natural Language" code and data, ECCV 2020 Oral.☆17Updated 4 years ago
- Transformation Driven Visual Reasoning - CVPR 2021☆34Updated last year
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- ☆58Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆22Updated 5 months ago
- ☆81Updated 2 years ago