Data of ACL 2019 Paper "Expressing Visual Relationships via Language".
☆62Sep 30, 2020Updated 5 years ago
Alternatives and similar repositories for VisualRelationships
Users that are interested in VisualRelationships are comparing it to the libraries listed below
Sorting:
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆60Apr 5, 2018Updated 7 years ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Aug 8, 2019Updated 6 years ago
- [EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction☆51Aug 20, 2022Updated 3 years ago
- Inferring and Executing Programs for Visual Reasoning☆21Jan 4, 2019Updated 7 years ago
- [ACL 2019] Visually Grounded Neural Syntax Acquisition☆90Feb 24, 2024Updated 2 years ago
- Pre-trained V+L Data Preparation☆46Jun 2, 2020Updated 5 years ago
- Website for TextVQA dataset.☆28Apr 30, 2023Updated 2 years ago
- logboard: Monitor and Compare Logs on Browser/Terminal.☆21Sep 19, 2019Updated 6 years ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆64Mar 24, 2023Updated 2 years ago
- Starter code for the VMT task and challenge☆51Jul 29, 2020Updated 5 years ago
- ☆30Oct 2, 2018Updated 7 years ago
- Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"☆30Jul 4, 2018Updated 7 years ago
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Apr 15, 2022Updated 3 years ago
- BISON: Binary Image SelectiON☆49Sep 15, 2021Updated 4 years ago
- Visaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)☆82Jun 15, 2018Updated 7 years ago
- Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…☆21Oct 20, 2020Updated 5 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Jul 17, 2020Updated 5 years ago
- Code for the paper "Representation Learning for Grounded Spatial Reasoning"☆52Jul 2, 2020Updated 5 years ago
- EMNLP 2018. Learning to Describe Differences Between Pairs of Similar Images. Harsh Jhamtani, Taylor Berg-Kirkpatrick.☆67Jan 27, 2026Updated last month
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆71Nov 17, 2019Updated 6 years ago
- Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding☆23Jun 27, 2018Updated 7 years ago
- ☆478Nov 21, 2022Updated 3 years ago
- Implementation for the AAAI2019 paper "Large-scale Visual Relationship Understanding"☆146Sep 3, 2019Updated 6 years ago
- video captioning☆24Mar 14, 2019Updated 6 years ago
- Rethinking the Form of Latent States in Image Captioning☆20Aug 31, 2018Updated 7 years ago
- Domain Agnostic Normalization layer for Unsupervised Domain Adaptation☆11Dec 8, 2022Updated 3 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Sep 1, 2018Updated 7 years ago
- An attempt at a PyTorch Implementation of "Zero-Shot" Super-Resolution using Deep Internal Learning by Shocher et al. CVPR 2018☆14Aug 30, 2018Updated 7 years ago
- Use transformer for captioning☆156May 2, 2019Updated 6 years ago
- Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…☆59Mar 24, 2023Updated 2 years ago
- Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)☆100Oct 17, 2022Updated 3 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆161Apr 29, 2020Updated 5 years ago
- Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".☆77Oct 3, 2023Updated 2 years ago
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆14Aug 6, 2018Updated 7 years ago
- VisualCOMET: Reasoning about the Dynamic Context of a Still Image☆88Jun 12, 2023Updated 2 years ago
- Neural Machine Translation with universal Visual Representation (ICLR 2020)☆90Jul 1, 2020Updated 5 years ago
- Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671☆18May 6, 2021Updated 4 years ago
- A length-controllable and non-autoregressive image captioning model.☆69Jun 10, 2021Updated 4 years ago