airsplay/VisualRelationships

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/airsplay/VisualRelationships)

airsplay / VisualRelationships

Data of ACL 2019 Paper "Expressing Visual Relationships via Language".

☆63

Alternatives and similar repositories for VisualRelationships

Users that are interested in VisualRelationships are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

harsh19 / spot-the-diff
View on GitHub
EMNLP 2018. Learning to Describe Differences Between Pairs of Similar Images. Harsh Jhamtani, Taylor Berg-Kirkpatrick.
☆70Jan 27, 2026Updated 5 months ago
jayleicn / VideoLanguageFuturePred
View on GitHub
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
☆52Aug 20, 2022Updated 3 years ago
yiyang92 / vae_captioning
View on GitHub
Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
☆60Apr 5, 2018Updated 8 years ago
lichengunc / pretrain-vl-data
View on GitHub
Pre-trained V+L Data Preparation
☆47Jun 2, 2020Updated 6 years ago
ExplorerFreda / VGNSL
View on GitHub
[ACL 2019] Visually Grounded Neural Syntax Acquisition
☆90Feb 24, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ronghanghu / gqa_single_hop_baseline
View on GitHub
A simple but well-performing "single-hop" visual attention model for the GQA dataset
☆20Aug 8, 2019Updated 6 years ago
zmykevin / UVLP
View on GitHub
CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment
☆21Apr 15, 2022Updated 4 years ago
yuleiniu / rva
View on GitHub
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
☆64Mar 24, 2023Updated 3 years ago
ccvl / iep-ref
View on GitHub
Inferring and Executing Programs for Visual Reasoning
☆21Jan 4, 2019Updated 7 years ago
zhegan27 / LXMERT-AdvTrain
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Oct 20, 2020Updated 5 years ago
eric-xw / Video-guided-Machine-Translation
View on GitHub
Starter code for the VMT task and challenge
☆51Jul 29, 2020Updated 5 years ago
henryhungle / MTN
View on GitHub
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
☆100Oct 17, 2022Updated 3 years ago
daicoolb / Awesome-Video-Captioning
View on GitHub
video captioning
☆24Mar 14, 2019Updated 7 years ago
tgGuo15 / PriorImageCaption
View on GitHub
☆30Oct 2, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jiasenlu / vilbert_beta
View on GitHub
☆478Nov 21, 2022Updated 3 years ago
LibertFan / ImageCaption
View on GitHub
Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019
☆17Sep 8, 2019Updated 6 years ago
Maluuba / GeNeVA_datasets
View on GitHub
Scripts to generate the CoDraw and i-CLEVR datasets used for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Gen…
☆41May 16, 2023Updated 3 years ago
wkentaro / logboard
View on GitHub
logboard: Monitor and Compare Logs on Browser/Terminal.
☆21Sep 19, 2019Updated 6 years ago
ruotianluo / Transformer_Captioning
View on GitHub
Use transformer for captioning
☆156May 2, 2019Updated 7 years ago
doubledaibo / 2dcaption_eccv2018
View on GitHub
Rethinking the Form of Latent States in Image Captioning
☆20Aug 31, 2018Updated 7 years ago
Deanplayerljx / tab-vcr
View on GitHub
Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671
☆19May 6, 2021Updated 5 years ago
jimmy646 / violin
View on GitHub
Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"
☆161Apr 29, 2020Updated 6 years ago
XiangChenchao / DDPN
View on GitHub
Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding
☆23Jun 27, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / TextVQA
View on GitHub
Website for TextVQA dataset.
☆30Apr 30, 2023Updated 3 years ago
yuleiniu / vc
View on GitHub
Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"
☆30Jul 4, 2018Updated 8 years ago
bearcatt / LaBERT
View on GitHub
A length-controllable and non-autoregressive image captioning model.
☆69Jun 10, 2021Updated 5 years ago
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
yikang-li / iQAN
View on GitHub
Visaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)
☆82Jun 15, 2018Updated 8 years ago
fawazsammani / show-edit-tell
View on GitHub
Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020
☆82Jul 17, 2020Updated 6 years ago
daqingliu / CAVP
View on GitHub
Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…
☆46Jul 27, 2019Updated 6 years ago
MayankSingal / PyTorch-Zero-Shot-Super-Resolution
View on GitHub
An attempt at a PyTorch Implementation of "Zero-Shot" Super-Resolution using Deep Internal Learning by Shocher et al. CVPR 2018
☆14Aug 30, 2018Updated 7 years ago
Unbabel / word-level-qe-corpus-builder
View on GitHub
Builds a WMT18-like corpus for word-level QE with annotations in the source and target words.
☆10Sep 19, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lichengunc / speaker_listener_reinforcer
View on GitHub
Torch Implementation of Speaker-Listener-Reinforcer for Referring Expression Generation and Comprehension
☆34Mar 8, 2018Updated 8 years ago
gnouhp / PyTorch-AdaHAN
View on GitHub
An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…
☆54Sep 1, 2018Updated 7 years ago
jz462 / Large-Scale-VRD.pytorch
View on GitHub
Implementation for the AAAI2019 paper "Large-scale Visual Relationship Understanding"
☆145Sep 3, 2019Updated 6 years ago
sushizixin / CLIP4IDC
View on GitHub
CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)
☆36Nov 12, 2022Updated 3 years ago
RiTUAL-MBZUAI / Font-prediction-dataset
View on GitHub
This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"
☆11May 5, 2020Updated 6 years ago
JaywongWang / CBP
View on GitHub
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆59Mar 24, 2023Updated 3 years ago
Seth-Park / RobustChangeCaptioning
View on GitHub
Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)
☆52Dec 8, 2022Updated 3 years ago