Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering
☆18Oct 30, 2019Updated 6 years ago
Alternatives and similar repositories for dynamic_fusion_reimplementation
Users that are interested in dynamic_fusion_reimplementation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- [NeurIPS 2021] Introspective Distillation for Robust Question Answering☆13Dec 7, 2021Updated 4 years ago
- ☆15May 10, 2021Updated 4 years ago
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆26Mar 28, 2023Updated 3 years ago
- Project for Dynamic Capsule Attention☆12Dec 7, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Re-implementation for 'R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering'.☆12Mar 13, 2026Updated last month
- Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)☆23Jun 26, 2020Updated 5 years ago
- [CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering☆26Apr 24, 2025Updated last year
- vqa drived by bottom-up and top-down attention and knowledge☆14Nov 21, 2018Updated 7 years ago
- 利用kafka+storm+mysql/redis构建日志监控系统☆13May 6, 2018Updated 8 years ago
- [PRCV-2023, IEEE TMM-2025] Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification☆12Dec 20, 2025Updated 4 months ago
- The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"☆27May 6, 2021Updated 5 years ago
- CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering☆78Jan 19, 2020Updated 6 years ago
- This is a code repository for Relation Transformer Network☆13Nov 30, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆27Oct 7, 2021Updated 4 years ago
- CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations☆30Oct 27, 2023Updated 2 years ago
- The official implementation of "Multi-Glimpse Network: A Robust and Efficient Classification Architecture based on Recurrent Downsampled …☆13Nov 4, 2021Updated 4 years ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 9 months ago
- converting the pretrained tensorflow SoundNet model to pytorch☆14Jun 15, 2022Updated 3 years ago
- DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog☆25Mar 8, 2022Updated 4 years ago
- ☆10May 23, 2023Updated 2 years ago
- Code for Greedy Gradient Ensemble for Visual Question Answering (ICCV 2021, Oral)☆27Mar 28, 2022Updated 4 years ago
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- ☆15Mar 11, 2023Updated 3 years ago
- ☆28Aug 22, 2024Updated last year
- ☆37Jan 20, 2023Updated 3 years ago
- Deep Modular Co-Attention Networks for Visual Question Answering☆459Dec 16, 2020Updated 5 years ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- Code base for paper "Finding Structural Knowledge in Multimodal-BERT". Framework for probing and code for creating Scene Trees.☆10May 19, 2022Updated 3 years ago
- Modification of the original Mask/Faster R-CNN☆12Dec 13, 2020Updated 5 years ago
- implement gat with batch☆10Nov 28, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A GCN based visual question generation model☆13Aug 21, 2019Updated 6 years ago
- Bilinear attention networks for visual question answering☆547Oct 30, 2023Updated 2 years ago
- This is the second version of the practices for the rookies of BJTUNLPers.☆18Jan 13, 2021Updated 5 years ago
- 对论文Neural Relation Extraction with Selective Attention over Instances的改进☆12Aug 17, 2018Updated 7 years ago
- Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation☆17Feb 9, 2025Updated last year
- 🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"☆13Feb 1, 2023Updated 3 years ago
- 12-in-1: Multi-Task Vision and Language Representation Learning Web Demo☆35Dec 8, 2022Updated 3 years ago