Eurus-Holmes / Pythia-VQALinks

Baseline for Visual Question Answering.

☆8

Alternatives and similar repositories for Pythia-VQA

Users that are interested in Pythia-VQA are comparing it to the libraries listed below

Sorting:

yuewang-cuhk / CMKP
Official code and data for EMNLP 2020 paper "Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attenti…
☆21Updated 4 years ago
Eurus-Holmes / MNMT
Pytorch implementation of Multimodal Neural Machine Translation(MNMT).
☆12Updated 4 years ago
cooelf / UVR-NMT
Neural Machine Translation with universal Visual Representation (ICLR 2020)
☆88Updated 5 years ago
lancopku / livebot
LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts (AAAI 2019)
☆122Updated 6 years ago
hongwang600 / Summarization
☆38Updated 5 years ago
yunjey / seq2seq-dataloader
PyTorch DataLoader for seq2seq
☆85Updated 6 years ago
JXZe / DualVD
☆77Updated 2 years ago
microsoft / EA-VQ-VAE
This repo provides the code for the ACL 2020 paper "Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEnco…
☆55Updated 4 years ago
shubhamagarwal92 / mmd
This repository contains the Pytorch implementation for our SCAI (EMNLP-2018) submission "A Knowledge-Grounded Multimodal Search-Based Co…
☆29Updated 5 years ago
karunraju / VQA
Hierarchical Question-Image Co-Attention for Visual Question Answering
☆24Updated 6 years ago
berniebear / Multi-HT100M
☆53Updated 3 years ago
lichengunc / vist_eval
vist story telling evaluation tool
☆21Updated last year
ictnlp / DSTC8-AVSD
We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…
☆56Updated 2 years ago
henryhungle / MTN
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
☆100Updated 2 years ago
eric-xw / AREL
Code for the ACL paper "No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling"
☆136Updated 4 years ago
TensorUI / relative-position-pytorch
a pytorch implementation of self-attention with relative position representations
☆50Updated 4 years ago
nurpeiis / LeakGAN-PyTorch
A simple implementation of LeakGAN in PyTorch
☆63Updated 3 years ago
lingyongyan / Neural-Machine-Translation
PyTorch implementation of "Effective Approaches to Attention-based Neural Machine Translation" using scheduled sampling to improve the pa…
☆38Updated 7 years ago
cvlab-tohoku / Dense-CoAttention-Network
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
☆106Updated 5 years ago
gicheonkang / dan-visdial
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
☆45Updated 2 years ago
zychen423 / KE-VIST
The code and output of our AAAI paper "Knowledge-Enriched Visual Storytelling"
☆40Updated 4 years ago
lynnna-xu / text-generation-transformer
text generation based on transformer
☆36Updated 6 years ago
yaushian / Tree-Transformer
Implementation of the paper Tree Transformer
☆214Updated 5 years ago
hrlinlp / cepsum
☆44Updated 3 years ago
hudaAlamri / DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge
☆53Updated 5 years ago
CharizardAcademy / convtransformer
Code for the ACL2020 paper Character-Level Translation with Self-Attention
☆31Updated 4 years ago
jiachenwestlake / Multi-Cell_LSTM
Multi-cell compositional LSTM for NER domain adaptation, code for ACL 2020 paper
☆30Updated 4 years ago
ranjaykrishna / iq
Information Maximizing Visual Question Generation
☆66Updated last year
tkim-snu / GLACNet
GLAC Net: GLocal Attention Cascading Network for the Visual Storytelling Challenge
☆45Updated 4 years ago
voidism / Transformer_CycleGAN_Text_Style_Transfer-pytorch
Implementation of CycleGAN for Text style transfer with PyTorch.
☆32Updated 5 years ago