cvlab-tohoku/Dense-CoAttention-Network

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cvlab-tohoku/Dense-CoAttention-Network)

cvlab-tohoku / Dense-CoAttention-Network

Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering

☆107

Alternatives and similar repositories for Dense-CoAttention-Network

Users that are interested in Dense-CoAttention-Network are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jnhwkim / ban-vqa
View on GitHub
Bilinear attention networks for visual question answering
☆549Oct 30, 2023Updated 2 years ago
MILVLG / mcan-vqa
View on GitHub
Deep Modular Co-Attention Networks for Visual Question Answering
☆459Dec 16, 2020Updated 5 years ago
hengyuan-hu / bottom-up-attention-vqa
View on GitHub
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
☆768Mar 10, 2024Updated 2 years ago
aimbrain / vqa-project
View on GitHub
Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering
☆150Mar 11, 2019Updated 7 years ago
yuzcccc / vqa-mfb
View on GitHub
☆184Jul 30, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lmelvix / visual-question-answering-tensorflow
View on GitHub
Stacked attention network for answering open-ended questions about image
☆12May 31, 2018Updated 8 years ago
AmingWu / CCN
View on GitHub
Connective Cognition Network for Directional Visual Commonsense Reasoning
☆15May 6, 2021Updated 5 years ago
KaihuaTang / VQA2.0-Recent-Approachs-2018.pytorch
View on GitHub
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…
☆300Jan 6, 2026Updated 6 months ago
MILVLG / openvqa
View on GitHub
A lightweight, scalable, and general framework for visual question answering research
☆333Sep 3, 2021Updated 4 years ago
karunraju / VQA
View on GitHub
Hierarchical Question-Image Co-Attention for Visual Question Answering
☆24Jun 2, 2019Updated 7 years ago
jiasenlu / HieCoAttenVQA
View on GitHub
☆351Oct 2, 2018Updated 7 years ago
shtechair / vqa-sva
View on GitHub
Structured Attentions for Visual Question Answering
☆46Mar 4, 2018Updated 8 years ago
Cyanogenoid / pytorch-vqa
View on GitHub
Strong baseline for visual question answering
☆240Mar 13, 2023Updated 3 years ago
jiasenlu / visDial.pytorch
View on GitHub
visual dialog model in pytorch
☆110May 16, 2018Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lixiangpengcs / PSAC
View on GitHub
Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
☆27Apr 15, 2021Updated 5 years ago
zhaoluffy / hLSTMat
View on GitHub
The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…
☆16Jun 29, 2017Updated 9 years ago
TingAnChien / san-vqa-tensorflow
View on GitHub
☆20May 6, 2019Updated 7 years ago
facebookresearch / corefnmn
View on GitHub
Visual Coreference Resolution in Visual Dialog using Neural Module Networks
☆58Oct 12, 2021Updated 4 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
lupantech / dual-mfa-vqa
View on GitHub
Co-attending Regions and Detections for VQA.
☆40Jun 2, 2018Updated 8 years ago
Cadene / murel.bootstrap.pytorch
View on GitHub
MUREL (CVPR 2019), a multimodal relational reasoning module for VQA
☆194Feb 9, 2020Updated 6 years ago
arya46 / VQA-Flask-App
View on GitHub
A simple Flask app to generate answer given an image and a natural language question about the image. The app uses a deep learning model,…
☆12Nov 21, 2022Updated 3 years ago
xh-liu / CM-Erase-REG
View on GitHub
Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"
☆34Jul 29, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
Cadene / vqa.pytorch
View on GitHub
Visual Question Answering in Pytorch
☆733Dec 11, 2019Updated 6 years ago
JunweiLiang / FVTA_MemexQA
View on GitHub
Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19
☆33Jul 1, 2019Updated 7 years ago
sergulaydore / Feature-Grouping-Regularizer
View on GitHub
Code for the paper "Feature Grouping as a Stochastic Regularizer for High-Dimensional Structured Data" at ICML 2019.
☆20Apr 22, 2019Updated 7 years ago
JunweiLiang / DualAttentionNetwork
View on GitHub
This repository contains the tensorflow implementation and models for DAN - CVPR 2017 paper
☆22Jul 13, 2018Updated 8 years ago
peteanderson80 / bottom-up-attention
View on GitHub
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
☆1,471Feb 3, 2023Updated 3 years ago
lucasjinreal / RetinaNet
View on GitHub
Pytorch Implementation of RetinaNet with CUDA accelerate nms operation.
☆10Jul 8, 2019Updated 7 years ago
Cadene / block.bootstrap.pytorch
View on GitHub
BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models
☆354Dec 4, 2019Updated 6 years ago
lixiangpengcs / Spatial-Temporal-Adaptive-Attention-for-Video-Captioning
View on GitHub
Extension of hLSTMat
☆19Apr 15, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ceyzaguirre4 / DACT-MAC
View on GitHub
Repository for hosting the code for the CVPR 2020 paper Differentiable Adaptive Computation Time for Visual Reasoning.
☆14Aug 26, 2020Updated 5 years ago
zaynmi / seada-vqa
View on GitHub
A pytorch implemetation of data augmentation method for visual question answering
☆21May 25, 2023Updated 3 years ago
zfchenUnique / Cops-Ref
View on GitHub
Accepted by CVPR 2020.
☆27Jul 11, 2024Updated 2 years ago
wangzheallen / STL-VQA
View on GitHub
The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…
☆19Jan 23, 2018Updated 8 years ago
casperhansen / fake-news-reasoning
View on GitHub
Automatic Fake News Detection: Are Models Learning to Reason. ACL 2021
☆16May 17, 2021Updated 5 years ago
jialinwu17 / self_critical_vqa
View on GitHub
Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''
☆40Sep 9, 2019Updated 6 years ago
atulkum / co-attention
View on GitHub
Pytorch implementation of "Dynamic Coattention Networks For Question Answering"
☆62Oct 21, 2018Updated 7 years ago