Co-attending Regions and Detections for VQA.
☆40Jun 2, 2018Updated 7 years ago
Alternatives and similar repositories for dual-mfa-vqa
Users that are interested in dual-mfa-vqa are comparing it to the libraries listed below
Sorting:
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- Code for ACL 2018 paper 'Think Visually: Question Answering through Virtual Imagery'☆13Mar 24, 2023Updated 2 years ago
- This released code is for our ACL2018 paper "End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions". …☆15May 28, 2018Updated 7 years ago
- ☆351Oct 2, 2018Updated 7 years ago
- Tracking by Joint Local and Global Search: A Target-aware Attention based Approach (IEEE TNNLS 2021)☆10Oct 26, 2021Updated 4 years ago
- Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"☆16Oct 22, 2022Updated 3 years ago
- Implementation for the paper "Dynamic Language Binding in Relational Visual Reasoning" (Le et al., IJCAI 2020)☆13Jul 25, 2024Updated last year
- multi-label classification on DBLP dataset with DeepWalk algorithm to extract latent dimension☆13Jan 17, 2015Updated 11 years ago
- Re-implementation: Ask Me Anything: Dynamic Memory Networks for Natural Language Processing☆14Apr 7, 2019Updated 6 years ago
- PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations☆42Jun 30, 2021Updated 4 years ago
- Arbitrary Style Transfer for Videos with Adaptive Instance Normalization https://arxiv.org/abs/1703.06868☆17Apr 5, 2017Updated 8 years ago
- Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)☆238Apr 16, 2018Updated 7 years ago
- Structured Attentions for Visual Question Answering☆46Mar 4, 2018Updated 8 years ago
- Source code of Knowledge Enhanced Hybrid Neural Network for Text Matching☆17Aug 15, 2018Updated 7 years ago
- Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''☆41Sep 9, 2019Updated 6 years ago
- This repository contains the tensorflow implementation and models for DAN - CVPR 2017 paper☆22Jul 13, 2018Updated 7 years ago
- ☆183Jul 30, 2019Updated 6 years ago
- ☆218Aug 13, 2016Updated 9 years ago
- Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019☆92Aug 9, 2019Updated 6 years ago
- caption images w/ visual attn☆19May 13, 2016Updated 9 years ago
- Pytorch implementation of "Dynamic Coattention Networks For Question Answering"☆62Oct 21, 2018Updated 7 years ago
- Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering☆107Oct 14, 2019Updated 6 years ago
- Code for NIPS 2018 paper, "Chain of Reasoning for Visual Question Answering"☆28Nov 23, 2018Updated 7 years ago
- Code for the Grounded Visual Question Answering (GVQA) model from the paper -- Don't Just Assume; Look and Answer: Overcoming Priors for …☆27Mar 10, 2022Updated 3 years ago
- Deep Point Process by PyTorch☆28Dec 9, 2020Updated 5 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆33Jul 1, 2019Updated 6 years ago
- Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"☆126Feb 11, 2020Updated 6 years ago
- PyTorch implementation of L-GCN [https://arxiv.org/abs/2008.09105]☆25Apr 25, 2021Updated 4 years ago
- ☆38Mar 30, 2021Updated 4 years ago
- Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)☆134Jul 25, 2024Updated last year
- MUREL (CVPR 2019), a multimodal relational reasoning module for VQA☆195Feb 9, 2020Updated 6 years ago
- TensorFlow implementation of "Learning to Rank Question-Answer Pairs using Hierarchical Recurrent Encoder with Latent Topic Clustering," …☆31Feb 18, 2021Updated 5 years ago
- Code for "ROAM: Recurrently Optimizing Tracking Model [CVPR2020]"☆36Nov 1, 2020Updated 5 years ago
- ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》☆36Jun 19, 2019Updated 6 years ago
- Some ROS code examples which hopefully are robot-agnostic.☆12May 16, 2018Updated 7 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- ☆10Dec 16, 2023Updated 2 years ago
- ☆10Apr 30, 2024Updated last year
- Matlab code for learning doubly sparse dictionary on synthetic data. Details can be found in the paper "A Provable Approach for Double-Sp…☆11Mar 5, 2018Updated 8 years ago