alibabadoufu/dynamic_fusion_reimplementation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibabadoufu/dynamic_fusion_reimplementation)

alibabadoufu / dynamic_fusion_reimplementation

Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering

☆17

Alternatives and similar repositories for dynamic_fusion_reimplementation

Users that are interested in dynamic_fusion_reimplementation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuleiniu / introd
View on GitHub
[NeurIPS 2021] Introspective Distillation for Robust Question Answering
☆13Dec 7, 2021Updated 4 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
sergiotasconmorales / consistency_vqa
View on GitHub
Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)
☆26Mar 28, 2023Updated 3 years ago
PhoebusSi / SAR
View on GitHub
Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"
☆31Nov 24, 2021Updated 4 years ago
XMUVQA / CapsAtt
View on GitHub
Project for Dynamic Capsule Attention
☆12Dec 7, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
BierOne / relation-vqa
View on GitHub
Re-implementation for 'R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering'.
☆12Mar 13, 2026Updated 4 months ago
erobic / negative_analysis_of_grounding
View on GitHub
Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)
☆23Jun 26, 2020Updated 6 years ago
SwordfallYeung / LogMonitor
View on GitHub
利用kafka+storm+mysql/redis构建日志监控系统
☆13May 6, 2018Updated 8 years ago
GaryGuTC / LaPA_model
View on GitHub
[CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering
☆27Apr 24, 2025Updated last year
jialinwu17 / self_critical_vqa
View on GitHub
Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''
☆40Sep 9, 2019Updated 6 years ago
aioz-ai / MICCAI19-MedVQA
View on GitHub
AIOZ AI - Overcoming Data Limitation in Medical Visual Question Answering (MICCAI 2019)
☆70Apr 21, 2026Updated 3 months ago
Event-AHU / EFV_event_classification
View on GitHub
[PRCV-2023, IEEE TMM-2025] Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification
☆12Dec 20, 2025Updated 7 months ago
HLR / Cross_Modality_Relevance
View on GitHub
The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"
☆27May 6, 2021Updated 5 years ago
rajatkoner08 / rtn
View on GitHub
This is a code repository for Relation Transformer Network
☆13Nov 30, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
siahuat0727 / MGNet
View on GitHub
The official implementation of "Multi-Glimpse Network: A Robust and Efficient Classification Architecture based on Recurrent Downsampled …
☆13Nov 4, 2021Updated 4 years ago
ExplainableML / CLEVR-X
View on GitHub
CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
☆30Oct 27, 2023Updated 2 years ago
haifangong / CMSA-MTPT-4-MedicalVQA
View on GitHub
[ICMR'21, Best Poster Paper Award] Medical Visual Question Answering with Multi-task Pre-training and Cross-modal Self-attention
☆34Dec 15, 2022Updated 3 years ago
VirajBagal / MMBERT
View on GitHub
MMBERT: Multimodal BERT Pretraining for Improved Medical VQA
☆39Mar 22, 2021Updated 5 years ago
phellonchen / DMRM
View on GitHub
DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog
☆25Mar 8, 2022Updated 4 years ago
yjybuaa / RGBDAerialTracking
View on GitHub
☆10May 23, 2023Updated 3 years ago
GeraldHan / GGE
View on GitHub
Code for Greedy Gradient Ensemble for Visual Question Answering （ICCV 2021, Oral）
☆27Mar 28, 2022Updated 4 years ago
CogComp / Salient-Event-Detection
View on GitHub
The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"
☆10Jul 5, 2022Updated 4 years ago
Awenbocc / CPCR
View on GitHub
☆15Mar 11, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
XianhuiChen / Bottleneck-Attention-Based-Fusion-Network-for-Sleep-Apnea-Detection
View on GitHub
☆28Aug 22, 2024Updated last year
CCYChongyanChen / VQA_AlgorithmDatasets
View on GitHub
☆37Jan 20, 2023Updated 3 years ago
MILVLG / mcan-vqa
View on GitHub
Deep Modular Co-Attention Networks for Visual Question Answering
☆459Dec 16, 2020Updated 5 years ago
kaiyinzhou / Relation_Extration
View on GitHub
Relation Classification via Convolutional Deep Neural Network
☆13Nov 9, 2018Updated 7 years ago
VSJMilewski / multimodal-probes
View on GitHub
Code base for paper "Finding Structural Knowledge in Multimodal-BERT". Framework for probing and code for creating Scene Trees.
☆10May 19, 2022Updated 4 years ago
yikang-li / PasteGAN
View on GitHub
An pytorch implementation of our NeurIPS paper of PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph
☆54Nov 22, 2022Updated 3 years ago
robintzeng / mask-rcnn-Pytorch
View on GitHub
Modification of the original Mask/Faster R-CNN
☆12Dec 13, 2020Updated 5 years ago
Adam1679 / mutan-article-net
View on GitHub
Implementation of Mutan+ArticleNet on OKVQA
☆10Jan 11, 2021Updated 5 years ago
UCSB-AI / CPL
View on GitHub
Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"
☆35Dec 5, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
fredfung007 / snlt
View on GitHub
☆15Dec 3, 2021Updated 4 years ago
Wangt-CN / VQG-GCN
View on GitHub
A GCN based visual question generation model
☆13Aug 21, 2019Updated 6 years ago
ShilongBoy / SpringBootKafka
View on GitHub
Strom 实时风控统计
☆21Nov 30, 2017Updated 8 years ago
jnhwkim / ban-vqa
View on GitHub
Bilinear attention networks for visual question answering
☆548Oct 30, 2023Updated 2 years ago
szzexpoi / AiR
View on GitHub
Official Repository for ECCV 2020 paper "AiR: Attention with Reasoning Capability"
☆54Jun 29, 2021Updated 5 years ago
linjieli222 / VQA_ReGAT
View on GitHub
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
☆187Apr 15, 2021Updated 5 years ago
2snoopy88 / GAT-with-batch
View on GitHub
implement gat with batch
☆10Nov 28, 2020Updated 5 years ago