noagarcia / ArtVQA
AQUA dataset and VIKING model for the task of Art Visual Question Answering
☆23Updated 3 years ago
Alternatives and similar repositories for ArtVQA:
Users that are interested in ArtVQA are comparing it to the libraries listed below
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆73Updated 2 years ago
- kdexd/coco-caption@de6f385☆26Updated 4 years ago
- CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification - 4th Workshop on Computer Vision for Fashion, Art, and Design☆27Updated 2 years ago
- ☆50Updated 2 years ago
- Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021☆19Updated 3 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆46Updated last year
- Tensorflow Implementation on Paper [CVPR2020]Image Search with Text Feedback by Visiolinguistic Attention Learning☆63Updated 4 years ago
- Multi-sense word embeddings from visual co-occurrences☆25Updated 5 years ago
- A large-scale dataset for instance-level recognition for artworks is introduced.☆48Updated last year
- This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in th…☆62Updated 2 years ago
- A dataset of crowdsourced ratings for machine-generated image captions☆35Updated 5 years ago
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆62Updated 4 years ago
- [CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning☆90Updated 10 months ago
- Data Release for VALUE Benchmark☆31Updated 3 years ago
- ☆34Updated last year
- Pre-trained V+L Data Preparation☆45Updated 4 years ago
- Starter Code for VALUE benchmark☆80Updated 2 years ago
- ☆42Updated 3 years ago
- A length-controllable and non-autoregressive image captioning model.☆68Updated 3 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Updated 2 years ago
- ☆73Updated 2 years ago
- Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO☆51Updated 4 years ago
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆37Updated 7 months ago
- Dense video captioning in PyTorch☆41Updated 5 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆134Updated last year
- ☆35Updated last year
- L-Verse: Bidirectional Generation Between Image and Text☆108Updated 2 years ago
- ☆32Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆41Updated 2 years ago