Re-implementation for 'R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering'.
☆12Mar 13, 2026Updated last week
Alternatives and similar repositories for relation-vqa
Users that are interested in relation-vqa are comparing it to the libraries listed below
Sorting:
- R-VQA: Visual Question Answering with Relation Facts☆19May 11, 2021Updated 4 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering☆52Aug 21, 2020Updated 5 years ago
- ☆14May 10, 2021Updated 4 years ago
- Rich Visual Knowledge-based AugmentationNetwork for Visual Question Answering☆10Dec 6, 2019Updated 6 years ago
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 2 years ago
- Project for Dynamic Capsule Attention☆12Dec 7, 2019Updated 6 years ago
- VQA baseline with Conditional Batch Normalization☆15Apr 9, 2018Updated 7 years ago
- Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"☆187Apr 15, 2021Updated 4 years ago
- Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering☆150Mar 11, 2019Updated 7 years ago
- Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''☆41Sep 9, 2019Updated 6 years ago
- Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering☆18Oct 30, 2019Updated 6 years ago
- Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021☆19Jul 27, 2021Updated 4 years ago
- ☆13Feb 11, 2021Updated 5 years ago
- The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"☆27May 6, 2021Updated 4 years ago
- ☆77Nov 22, 2022Updated 3 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- MUREL (CVPR 2019), a multimodal relational reasoning module for VQA☆195Feb 9, 2020Updated 6 years ago
- Structured Attentions for Visual Question Answering☆46Mar 4, 2018Updated 8 years ago
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆72Feb 9, 2024Updated 2 years ago
- Torch code for Visual Question Generation☆14Mar 30, 2019Updated 6 years ago
- Deep Modular Co-Attention Networks for Visual Question Answering☆458Dec 16, 2020Updated 5 years ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- implement gat with batch☆10Nov 28, 2020Updated 5 years ago
- BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models☆356Dec 4, 2019Updated 6 years ago
- A Pytorch implementation of CVPR 2020 paper: Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text☆51May 22, 2023Updated 2 years ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆51Jan 30, 2026Updated last month
- BERT系列模型、搜搜、剪枝、蒸馏☆13Sep 10, 2020Updated 5 years ago
- Pytorch 0.41 implementation of the U-Net for image semantic segmentation + Dataloader for ISBI 2012 Challenge☆14Jul 15, 2020Updated 5 years ago
- Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019☆92Aug 9, 2019Updated 6 years ago
- Visual Question Answering Project with state of the art single Model performance.☆131Jun 18, 2018Updated 7 years ago
- 12-in-1: Multi-Task Vision and Language Representation Learning Web Demo☆35Dec 8, 2022Updated 3 years ago
- The official implement of "Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings"☆18Dec 5, 2024Updated last year
- tensorflow in depth☆14Jul 25, 2018Updated 7 years ago
- Event based Sign-Language-Translation☆19Feb 27, 2026Updated 3 weeks ago
- Decoding h264 rtsp stream with libavformat☆15Nov 19, 2015Updated 10 years ago
- This repository contains code and models for the paper: Semantic Graphs for Generating Deep Questions (ACL 2020).☆65Jan 20, 2024Updated 2 years ago
- Dynamic Spear Model☆12Jul 24, 2019Updated 6 years ago
- ☆12Dec 20, 2024Updated last year