adobe-research / vaw_datasetLinks
This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild" and the ECCV 2022 paper titled "Improving Closed and Open-Vocabulary Attribute Prediction using Transformers"
☆66Updated 2 years ago
Alternatives and similar repositories for vaw_dataset
Users that are interested in vaw_dataset are comparing it to the libraries listed below
Sorting:
- [ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and …☆62Updated 3 years ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated 2 years ago
- ☆27Updated last year
- ☆64Updated last year
- [ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"☆101Updated 2 years ago
- ☆56Updated last month
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆111Updated 5 years ago
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆46Updated last year
- ☆83Updated 3 years ago
- ☆59Updated 3 years ago
- A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization☆21Updated 4 months ago
- kdexd/coco-caption@de6f385☆26Updated 5 years ago
- A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision…☆37Updated 4 years ago
- [CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》☆62Updated 3 years ago
- ☆26Updated last year
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆67Updated 3 years ago
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆69Updated 3 years ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆35Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆32Updated 2 years ago
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆151Updated last year
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 2 years ago
- Code for the CVPR 2020 oral paper: Weakly Supervised Visual Semantic Parsing☆35Updated 2 years ago
- This repo is the official implementation of UPL (Unsupervised Prompt Learning for Vision-Language Models).☆116Updated 3 years ago
- Shared Attention for Multi-label Zero-shot Learning accepted @ CVPR20☆32Updated 3 years ago
- [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast p…☆131Updated 3 years ago
- ☆50Updated 2 years ago
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆52Updated last year
- UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)☆87Updated last year
- ☆26Updated 3 years ago
- ☆30Updated last year