w-yi / DiagNet
Code for DiagNet: Bridging Text and Image
☆10Updated 6 years ago
Alternatives and similar repositories for DiagNet
Users that are interested in DiagNet are comparing it to the libraries listed below
Sorting:
- Transfer Learning via Unsupervised Task Discovery for Visual Question Answering☆31Updated 6 years ago
- ☆11Updated 7 years ago
- Code for Unsupervised Discovery of Multimodal Links in Multi-Image/Multi-Sentence Documents☆30Updated 4 years ago
- hierarchical convolutional attention networks for text classification☆16Updated 5 years ago
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Updated 3 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Updated 4 years ago
- ☆26Updated 5 years ago
- Transfer Learning via Unsupervised Task Discovery for Visual Question Answering☆19Updated 6 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Updated 7 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Updated last year
- The implementation of multi-branch attentive Transformer (MAT).☆33Updated 4 years ago
- Probing task; contextual embeddings -> textual definitions (EMNLP19)☆11Updated 4 years ago
- Generate a denotation graph from a set of image captions☆15Updated 6 years ago
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Updated 3 years ago
- Code for paper "Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling"☆7Updated 5 years ago
- https://arxiv.org/abs/1707.00836☆21Updated 7 years ago
- Code and Data for the paper Investigating Evaluation of Open-Domain Dialogue Systems With Human Generated Multiple References SIGdial 201…☆28Updated 5 years ago
- Visual Navigation with Natural Multimodal Assistance (EMNLP 2019)☆28Updated 4 years ago
- Code associated with the "Natural Language Rationales with Full-Stack Visual Reasoning" EMNLP Findings 2020 paper☆24Updated 4 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Updated last year
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 4 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Updated 2 years ago
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆14Updated 6 years ago
- Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs☆26Updated 6 years ago
- implement n2nmn with pytorch☆19Updated 6 years ago
- Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"☆19Updated 6 years ago
- GASP! Dataset - Generating Abstracts of Scientific Papers from Abstracts of Cited Papers☆9Updated 5 years ago
- ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…☆26Updated 6 years ago
- Source Code for paper "Learning from Explanations with Neural Execution Tree", ICLR 2020☆18Updated 4 years ago
- Tools for training pytorch language models☆27Updated 4 years ago