lupantech / dual-mfa-vqaView external linksLinks
Co-attending Regions and Detections for VQA.
☆40Jun 2, 2018Updated 7 years ago
Alternatives and similar repositories for dual-mfa-vqa
Users that are interested in dual-mfa-vqa are comparing it to the libraries listed below
Sorting:
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- Code for ACL 2018 paper 'Think Visually: Question Answering through Virtual Imagery'☆13Mar 24, 2023Updated 2 years ago
- This released code is for our ACL2018 paper "End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions". …☆15May 28, 2018Updated 7 years ago
- ☆351Oct 2, 2018Updated 7 years ago
- multi-label classification on DBLP dataset with DeepWalk algorithm to extract latent dimension☆13Jan 17, 2015Updated 11 years ago
- Implementation for the paper "Dynamic Language Binding in Relational Visual Reasoning" (Le et al., IJCAI 2020)☆13Jul 25, 2024Updated last year
- Tracking by Joint Local and Global Search: A Target-aware Attention based Approach (IEEE TNNLS 2021)☆10Oct 26, 2021Updated 4 years ago
- Re-implementation: Ask Me Anything: Dynamic Memory Networks for Natural Language Processing☆14Apr 7, 2019Updated 6 years ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆16Apr 22, 2019Updated 6 years ago
- Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671☆18May 6, 2021Updated 4 years ago
- implement n2nmn with pytorch☆19Apr 10, 2019Updated 6 years ago
- PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations☆42Jun 30, 2021Updated 4 years ago
- Arbitrary Style Transfer for Videos with Adaptive Instance Normalization https://arxiv.org/abs/1703.06868☆17Apr 5, 2017Updated 8 years ago
- Random memory adaptation model inspired by the paper: "Memory-based parameter adaptation (MbPA)"☆24Mar 13, 2018Updated 7 years ago
- Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)☆238Apr 16, 2018Updated 7 years ago
- Structured Attentions for Visual Question Answering☆46Mar 4, 2018Updated 7 years ago
- Source code of Knowledge Enhanced Hybrid Neural Network for Text Matching☆17Aug 15, 2018Updated 7 years ago
- Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''☆41Sep 9, 2019Updated 6 years ago
- This repository contains the tensorflow implementation and models for DAN - CVPR 2017 paper☆22Jul 13, 2018Updated 7 years ago
- ☆183Jul 30, 2019Updated 6 years ago
- ☆218Aug 13, 2016Updated 9 years ago
- Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering☆25Nov 4, 2020Updated 5 years ago
- Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019☆92Aug 9, 2019Updated 6 years ago
- caption images w/ visual attn☆19May 13, 2016Updated 9 years ago
- Pytorch implementation of "Dynamic Coattention Networks For Question Answering"☆62Oct 21, 2018Updated 7 years ago
- Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering☆107Oct 14, 2019Updated 6 years ago
- Code for NIPS 2018 paper, "Chain of Reasoning for Visual Question Answering"☆28Nov 23, 2018Updated 7 years ago
- Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"☆29Oct 24, 2018Updated 7 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆33Jul 1, 2019Updated 6 years ago
- Deep Point Process by PyTorch☆28Dec 9, 2020Updated 5 years ago
- Code for the paper "Ask Me Any Rating: A Content-based Recommender System based on Recurrent Neural Networks"☆30Sep 5, 2017Updated 8 years ago
- Tensorflow implementation of Attention-over-Attention Neural Networks for Reading Comprehension☆28Sep 25, 2016Updated 9 years ago
- PyTorch implementation of L-GCN [https://arxiv.org/abs/2008.09105]☆25Apr 25, 2021Updated 4 years ago
- Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"☆126Feb 11, 2020Updated 6 years ago
- ☆38Mar 30, 2021Updated 4 years ago
- Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)☆134Jul 25, 2024Updated last year
- MUREL (CVPR 2019), a multimodal relational reasoning module for VQA☆195Feb 9, 2020Updated 6 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Jun 29, 2020Updated 5 years ago
- ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》☆36Jun 19, 2019Updated 6 years ago