henryhungle / MTNView external linksLinks
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
☆100Oct 17, 2022Updated 3 years ago
Alternatives and similar repositories for MTN
Users that are interested in MTN are comparing it to the libraries listed below
Sorting:
- Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)☆11Jun 16, 2025Updated 7 months ago
- ✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"☆45Mar 19, 2023Updated 2 years ago
- Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"☆19Oct 25, 2018Updated 7 years ago
- Code for the paper Non-Autoregressive Dialog State Tracking (ICLR20)☆44Feb 25, 2020Updated 5 years ago
- Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue☆14Oct 12, 2021Updated 4 years ago
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆49Feb 18, 2020Updated 5 years ago
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆116Jul 25, 2024Updated last year
- ☆15Aug 13, 2020Updated 5 years ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆64Mar 24, 2023Updated 2 years ago
- Code and Data for ACL 2019 "Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention"☆137Oct 7, 2019Updated 6 years ago
- Code, Models and Datasets for OpenViDial Dataset☆132Jan 22, 2022Updated 4 years ago
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆62Sep 30, 2020Updated 5 years ago
- ☆54Nov 18, 2019Updated 6 years ago
- ☆27May 4, 2020Updated 5 years ago
- Visual Coreference Resolution in Visual Dialog using Neural Module Networks☆57Oct 12, 2021Updated 4 years ago
- [ACL'19] [PyTorch] Multimodal Transformer☆958Sep 12, 2022Updated 3 years ago
- ☆10Jun 11, 2019Updated 6 years ago
- A tensorflow implementation of VHRED(A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues)☆17Mar 24, 2019Updated 6 years ago
- Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``☆27May 26, 2020Updated 5 years ago
- ☆77Nov 22, 2022Updated 3 years ago
- [ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering☆132Oct 25, 2022Updated 3 years ago
- ☆44Jun 16, 2025Updated 7 months ago
- A novel method of constrained decoding for neural NLG (NNLG) models☆84Jul 13, 2020Updated 5 years ago
- With the aim of building next generation virtual assistants that can handle multimodal inputs and perform multimodal actions, we introduc…☆133Oct 21, 2023Updated 2 years ago
- Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"☆187Apr 15, 2021Updated 4 years ago
- PyTorch code for Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation (AQM+) (ICLR 2019)☆51Feb 12, 2019Updated 7 years ago
- Source code and data for the paper "Towards String-to-Tree Neural Machine Translation"☆16Dec 31, 2017Updated 8 years ago
- Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)☆14Apr 16, 2019Updated 6 years ago
- Code on IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems (WWW 2020)☆11Apr 18, 2021Updated 4 years ago
- [CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog☆34Feb 1, 2021Updated 5 years ago
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 5 years ago
- [ACL 2019] Visually Grounded Neural Syntax Acquisition☆90Feb 24, 2024Updated last year
- Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"☆147Jun 10, 2019Updated 6 years ago
- Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…☆59Mar 24, 2023Updated 2 years ago
- Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)☆469May 6, 2021Updated 4 years ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Aug 19, 2022Updated 3 years ago
- This repository contains the Pytorch implementation for our SCAI (EMNLP-2018) submission "A Knowledge-Grounded Multimodal Search-Based Co…☆30Jun 4, 2020Updated 5 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- DeNSe parser in Dependency Parsing as Head Selection (EACL 2017) https://arxiv.org/abs/1606.01280☆25Apr 27, 2017Updated 8 years ago