w-yi / DiagNetLinks

Code for DiagNet: Bridging Text and Image

☆10

Alternatives and similar repositories for DiagNet

Users that are interested in DiagNet are comparing it to the libraries listed below

Sorting:

HyeonwooNoh / VQA-Transfer-ExternalData
Transfer Learning via Unsupervised Task Discovery for Visual Question Answering
☆19Updated 6 years ago
HyeonwooNoh / vqa_task_discovery
Transfer Learning via Unsupervised Task Discovery for Visual Question Answering
☆31Updated 6 years ago
jmhessel / multi-retrieval
Code for Unsupervised Discovery of Multimodal Links in Multi-Image/Multi-Sentence Documents
☆30Updated 4 years ago
niansong1996 / wassp
Official code for AAAI'20 paper "Merging Weak and Active Supervision for Semantic Parsing"
☆11Updated 2 years ago
jxhe / sparse-text-prototype
PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"
☆22Updated 3 years ago
wangzheallen / STL-VQA
The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…
☆19Updated 7 years ago
ishalyminov / babi_tools
Augmentation scripts for the bAbI Dialog Tasks dataset
☆13Updated 6 years ago
aylai / DenotationGraph
Generate a denotation graph from a set of image captions
☆15Updated 6 years ago
alexnowakvila / DiCoNet
☆11Updated 7 years ago
ShaojieJiang / tldr
Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"
☆10Updated last year
YuJiang01 / n2nmn_pytorch
implement n2nmn with pytorch
☆19Updated 6 years ago
arijitray1993 / VQARelevance
Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
☆14Updated 6 years ago
microsoft / GEM
☆24Updated 4 years ago
yekeren / ADVISE-Image_ads_understanding
ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…
☆26Updated 6 years ago
yongqyu / hcan-pytorch
hierarchical convolutional attention networks for text classification
☆16Updated 5 years ago
UKPLab / refresh2018-predicting-trends-from-arxiv
☆26Updated 5 years ago
ofirpress / PartialShuffle
☆14Updated 6 years ago
MurtyShikhar / ExpBERT
Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"
☆29Updated 5 years ago
seilna / CNN-Units-in-NLP
Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs
☆26Updated 6 years ago
intersun / CoDIR
Code for EMNLP 2020 paper CoDIR
☆41Updated 2 years ago
VisionLearningGroup / MULE
Implementation of "MULE: Multimodal Universal Language Embedding"
☆16Updated 5 years ago
acmi-lab / pretraining-with-nonsense
Pretraining summarization models using a corpus of nonsense
☆13Updated 3 years ago
wenhuchen / GPT2-Logic2Text
The code for Template-GPT-2 Generation Model for Logic2Text Dataset
☆18Updated 5 years ago
simonjisu / pytorch_tutorials
some tutorials for blog: simonjisu.github.io
☆23Updated 4 years ago
PurdueMINDS / GNNsMiscalibrated
☆16Updated 5 years ago
naver / aqm-plus
PyTorch code for Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation (AQM+) (ICLR 2019)
☆50Updated 6 years ago
MiuLab / GenDef
Probing task; contextual embeddings -> textual definitions (EMNLP19)
☆11Updated 4 years ago
UKPLab / emnlp2018-april
☆12Updated 6 years ago
e-bug / cross-modal-ablation
[EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…
☆20Updated 3 years ago
yekeren / Story-Video_ads_understanding
LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".
☆14Updated 4 years ago