cvlab-tohoku / Dense-CoAttention-NetworkView external linksLinks
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
☆107Oct 14, 2019Updated 6 years ago
Alternatives and similar repositories for Dense-CoAttention-Network
Users that are interested in Dense-CoAttention-Network are comparing it to the libraries listed below
Sorting:
- Deep Modular Co-Attention Networks for Visual Question Answering☆458Dec 16, 2020Updated 5 years ago
- Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering☆150Mar 11, 2019Updated 6 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆765Mar 10, 2024Updated last year
- A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…☆297Jan 6, 2026Updated last month
- Strong baseline for visual question answering☆240Mar 13, 2023Updated 2 years ago
- Stacked attention network for answering open-ended questions about image☆12May 31, 2018Updated 7 years ago
- ☆351Oct 2, 2018Updated 7 years ago
- ☆183Jul 30, 2019Updated 6 years ago
- A lightweight, scalable, and general framework for visual question answering research☆330Sep 3, 2021Updated 4 years ago
- MUREL (CVPR 2019), a multimodal relational reasoning module for VQA☆195Feb 9, 2020Updated 6 years ago
- Connective Cognition Network for Directional Visual Commonsense Reasoning☆15May 6, 2021Updated 4 years ago
- Hierarchical Question-Image Co-Attention for Visual Question Answering☆24Jun 2, 2019Updated 6 years ago
- Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering☆27Apr 15, 2021Updated 4 years ago
- Structured Attentions for Visual Question Answering☆46Mar 4, 2018Updated 7 years ago
- Code for the paper "Feature Grouping as a Stochastic Regularizer for High-Dimensional Structured Data" at ICML 2019.☆20Apr 22, 2019Updated 6 years ago
- Code for ICLR2018 paper: Improving GAN Training via Binarized Representation Entropy (BRE) Regularization - Y. Cao · W Ding · Y.C. Lui · …☆20Jun 12, 2018Updated 7 years ago
- Visual Coreference Resolution in Visual Dialog using Neural Module Networks☆57Oct 12, 2021Updated 4 years ago
- Co-attending Regions and Detections for VQA.☆40Jun 2, 2018Updated 7 years ago
- Visual Question Answering in Pytorch☆734Dec 11, 2019Updated 6 years ago
- The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…☆16Jun 29, 2017Updated 8 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆34Jul 29, 2019Updated 6 years ago
- Pytorch implementation of "Dynamic Coattention Networks For Question Answering"☆62Oct 21, 2018Updated 7 years ago
- This repository contains the tensorflow implementation and models for DAN - CVPR 2017 paper☆22Jul 13, 2018Updated 7 years ago
- ☆20May 6, 2019Updated 6 years ago
- Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"☆126Feb 11, 2020Updated 6 years ago
- Pytorch Implementation of RetinaNet with CUDA accelerate nms operation.☆10Jul 8, 2019Updated 6 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- A Python implementation of a graph-based parser for Abstract Meaning Representation (AMR)☆11Feb 2, 2018Updated 8 years ago
- BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models☆356Dec 4, 2019Updated 6 years ago
- Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering☆25Nov 4, 2020Updated 5 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- Attention-based Visual Question Answering in Torch☆101Aug 13, 2017Updated 8 years ago
- Motion-conditional image animation for video editing☆20Dec 2, 2023Updated 2 years ago
- ☆12Aug 29, 2019Updated 6 years ago
- Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''☆41Sep 9, 2019Updated 6 years ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆1,465Feb 3, 2023Updated 3 years ago
- Pytorch implementation of bytenet from "Neural Machine Translation in Linear Time" paper☆46Dec 19, 2017Updated 8 years ago
- 4th International Workshop on Event-based Vision, CVPR 2023 https://tub-rip.github.io/eventvision2023/☆16Jun 21, 2025Updated 7 months ago
- visual dialog model in pytorch☆110May 16, 2018Updated 7 years ago