wangpengnorman/FVQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wangpengnorman/FVQA)

wangpengnorman / FVQA

☆22

Alternatives and similar repositories for FVQA

Users that are interested in FVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cs-jerhuang / P-VQA
View on GitHub
Medical Knowledge-Based Network For Patient-oriented Visual Question Answering
☆19Feb 25, 2023Updated 3 years ago
China-UK-ZSL / ZS-F-VQA
View on GitHub
[Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph
☆72Feb 9, 2024Updated 2 years ago
jlian2 / mucko
View on GitHub
Pytorch Implementation of MUCKO(2020 IJCAI)
☆20Oct 25, 2020Updated 5 years ago
jialinwu17 / MAVEX
View on GitHub
☆30Dec 16, 2022Updated 3 years ago
AndersonStra / Mucko
View on GitHub
implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
☆10Mar 17, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Adam1679 / mutan-article-net
View on GitHub
Implementation of Mutan+ArticleNet on OKVQA
☆10Jan 11, 2021Updated 5 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
AndersonStra / MuKEA
View on GitHub
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
☆101Mar 30, 2023Updated 3 years ago
alirezasalemi7 / DEDR-MM-FiD
View on GitHub
the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering
☆14Aug 22, 2023Updated 2 years ago
ThalesGroup / ConceptBERT
View on GitHub
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
☆31Apr 30, 2024Updated 2 years ago
leimiaomiao / Multi-Objective-Workflow-Scheduling
View on GitHub
Implementation of genetic algorithm and MOHEFT algorithm for scheduling workflow in mobile cloud.
☆19Feb 22, 2017Updated 9 years ago
zhegan27 / LXMERT-AdvTrain
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Oct 20, 2020Updated 5 years ago
BierOne / bottom-up-attention-vqa
View on GitHub
An updated PyTorch implementation of hengyuan-hu's version for 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question…
☆34Mar 13, 2026Updated 4 months ago
MILVLG / rosita
View on GitHub
ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
☆57Jun 13, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
luomancs / retriever_reader_for_okvqa
View on GitHub
☆19Dec 8, 2022Updated 3 years ago
HanqingWangAI / Active_VLN
View on GitHub
The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`
☆44Apr 9, 2022Updated 4 years ago
YuJungHeo / kbvqa-public
View on GitHub
☆40Nov 29, 2022Updated 3 years ago
wangmengsd / richpedia
View on GitHub
Richpedia: A Comprehensive Multi-Modal Knowledge Graph
☆53Apr 18, 2019Updated 7 years ago
ZhaozwTD / KPE
View on GitHub
Codes for ACL2023 paper: Knowledgeable Parameter Efficient Tuning Network for Commonsense Question Answering.
☆11Sep 23, 2023Updated 2 years ago
Zehong-Ma / OVMR
View on GitHub
OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)
☆36Jun 16, 2025Updated last year
3dlg-hcvc / LAW-VLNCE
View on GitHub
Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments
☆13Nov 29, 2021Updated 4 years ago
jingchenchen / ReasoningConsistency-VQA
View on GitHub
☆13Aug 14, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
cwj1412 / MSCOCO-Flikcr30K_FG
View on GitHub
Benchmark data for "Rethinking Benchmarks for Cross-modal Image-text Retrieval" (SIGIR 2023)
☆28Apr 24, 2023Updated 3 years ago
NeverMoreLCH / Awesome-VQA
View on GitHub
A reading list of papers about Visual Question Answering.
☆35Aug 17, 2022Updated 3 years ago
sail-sg / VGT
View on GitHub
Video Graph Transformer for Video Question Answering (ECCV'22)
☆49Jun 8, 2023Updated 3 years ago
sanket0211 / WK-VQA
View on GitHub
World Knowledge Based Visual Question Answering
☆22Nov 26, 2020Updated 5 years ago
Atmegal / MFURLN-CVPR-2019-relationship-detection-method
View on GitHub
MFURLN relationship detection method
☆21May 17, 2020Updated 6 years ago
UKPLab / emnlp2020-multicqa
View on GitHub
MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale
☆14Mar 22, 2021Updated 5 years ago
ChengHan111 / VPT-or-FT
View on GitHub
Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)
☆13Mar 8, 2024Updated 2 years ago
aioz-ai / MICCAI21_MMQ
View on GitHub
Multiple Meta-model Quantifying for Medical Visual Question Answering (MICCAI 2021)
☆37Apr 21, 2026Updated 3 months ago
PKU-ICST-MIPL / MKVSE-TOMM2023
View on GitHub
☆28May 16, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
expectorlin / DR-Attacker
View on GitHub
code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)
☆10Jul 15, 2022Updated 4 years ago
Lizw14 / CaliCO
View on GitHub
Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
☆15Jan 24, 2023Updated 3 years ago
Shivanshu-Gupta / Visual-Question-Answering
View on GitHub
CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering
☆78Jan 19, 2020Updated 6 years ago
vuhoangminh / vqa_medical
View on GitHub
☆10Oct 20, 2022Updated 3 years ago
amazon-science / indoor-scene-generation-eai
View on GitHub
☆61Jul 25, 2023Updated 3 years ago
arjunmajum / vln-bert
View on GitHub
Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)
☆59Oct 7, 2022Updated 3 years ago
lisun-ai / DocAgent
View on GitHub
Official Python implementation for DocAgent, accepted to EMNLP 2025
☆19Nov 4, 2025Updated 8 months ago