adobe-research / vaw_dataset
This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild" and the ECCV 2022 paper titled "Improving Closed and Open-Vocabulary Attribute Prediction using Transformers"
☆62Updated 2 years ago
Alternatives and similar repositories for vaw_dataset:
Users that are interested in vaw_dataset are comparing it to the libraries listed below
- ☆64Updated last year
- [ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and …☆60Updated 3 years ago
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆72Updated 4 years ago
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆109Updated 4 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Updated 3 years ago
- ☆59Updated 3 years ago
- [ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"☆99Updated last year
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆35Updated last year
- ☆25Updated last year
- ☆25Updated 3 years ago
- ☆29Updated last year
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 2 years ago
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆66Updated 2 years ago
- "Describing Textures using Natural Language" code and data, ECCV 2020 Oral.☆17Updated 4 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆68Updated 3 years ago
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆44Updated 4 years ago
- AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"☆57Updated 3 years ago
- This repository contains the code for our CVPR 2022 paper on "Integrating Language Guidance into Vision-based Deep Metric Learning".☆43Updated 2 years ago
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆37Updated 7 months ago
- A curated list of research papers in Referring Expression Comprehension (REC)☆43Updated 3 years ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆65Updated 2 years ago
- ☆56Updated 2 years ago
- Shared Attention for Multi-label Zero-shot Learning accepted @ CVPR20☆32Updated 3 years ago
- A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization☆21Updated 3 weeks ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- [ECCV'22 Poster] Explicit Image Caption Editing☆21Updated 2 years ago
- ☆35Updated last year
- Using LLMs and pre-trained caption models for super-human performance on image captioning.☆40Updated last year
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆59Updated last year