aioz-ai/CFR_VQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aioz-ai/CFR_VQA)

aioz-ai / CFR_VQA

Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)

☆48

Alternatives and similar repositories for CFR_VQA

Users that are interested in CFR_VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AndersonStra / MuKEA
View on GitHub
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
☆101Mar 30, 2023Updated 3 years ago
val-iisc / RMLVQA
View on GitHub
☆19May 31, 2023Updated 3 years ago
jialinwu17 / MAVEX
View on GitHub
☆30Dec 16, 2022Updated 3 years ago
guoyang9 / UnifER
View on GitHub
Official implementation for the MM'22 paper.
☆14Jun 30, 2022Updated 4 years ago
YuJungHeo / kbvqa-public
View on GitHub
☆40Nov 29, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
CCIIPLab / DPT
View on GitHub
The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering
☆20May 10, 2022Updated 4 years ago
aurooj / SHG-VQA
View on GitHub
Learning Situation Hyper-Graphs for Video Question Answering
☆23Feb 16, 2024Updated 2 years ago
sjtuytc / Neurips21-ProTo-Program-guided-Transformers-for-Program-guided-Tasks
View on GitHub
Official code repo for "ProTo: program-guided Transformers for Program-guided Tasks
☆21Apr 15, 2022Updated 4 years ago
PhoebusSi / SAR
View on GitHub
Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"
☆31Nov 24, 2021Updated 4 years ago
szzexpoi / POEM
View on GitHub
Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…
☆10Jun 16, 2024Updated 2 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
codexxxl / GraphVQA
View on GitHub
GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering
☆65Sep 4, 2021Updated 4 years ago
yuleiniu / introd
View on GitHub
[NeurIPS 2021] Introspective Distillation for Robust Question Answering
☆13Dec 7, 2021Updated 4 years ago
Zhiquan-Wen / D-VQA
View on GitHub
PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)
☆26Oct 13, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
ThalesGroup / ConceptBERT
View on GitHub
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
☆31Apr 30, 2024Updated 2 years ago
rentainhe / TRAR-VQA
View on GitHub
[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
☆68Oct 11, 2021Updated 4 years ago
ovguyo / captions-in-VQA
View on GitHub
Using image captions with LLM for zero-shot VQA
☆19Mar 14, 2024Updated 2 years ago
yashkant / sam-textvqa
View on GitHub
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
☆65Sep 15, 2021Updated 4 years ago
ZihaoW123 / UniMM
View on GitHub
Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"
☆13May 12, 2023Updated 3 years ago
HKUST-KnowComp / VD-PCR
View on GitHub
Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"
☆10Nov 1, 2022Updated 3 years ago
AndersonStra / Mucko
View on GitHub
implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
☆10Mar 17, 2022Updated 4 years ago
jlian2 / mucko
View on GitHub
Pytorch Implementation of MUCKO(2020 IJCAI)
☆20Oct 25, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
wenhuchen / Meta-Module-Network
View on GitHub
Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"
☆43May 13, 2021Updated 5 years ago
CrossmodalGroup / SSL-VQA
View on GitHub
Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
☆52Aug 21, 2020Updated 5 years ago
zaynmi / seada-vqa
View on GitHub
A pytorch implemetation of data augmentation method for visual question answering
☆21May 25, 2023Updated 3 years ago
MILVLG / prophet
View on GitHub
Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".
☆278Jun 14, 2025Updated last year
microsoft / PICa
View on GitHub
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)
☆88Apr 10, 2022Updated 4 years ago
vipulgupta1011 / swapmix
View on GitHub
☆20Oct 21, 2022Updated 3 years ago
ItemZheng / KDDAug
View on GitHub
[ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering
☆13Nov 23, 2022Updated 3 years ago
gicheonkang / sglkt-visdial
View on GitHub
🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"
☆13Feb 1, 2023Updated 3 years ago
aioz-ai / MICCAI21_MMQ
View on GitHub
Multiple Meta-model Quantifying for Medical Visual Question Answering (MICCAI 2021)
☆37Apr 21, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bowen-upenn / Multi-Agent-VQA
View on GitHub
[CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering
☆22Sep 21, 2024Updated last year
linjieli222 / VQA_ReGAT
View on GitHub
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
☆187Apr 15, 2021Updated 5 years ago
CarolineGao / LoRA-Dataset
View on GitHub
[NeurIPS2023] LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering
☆12Jan 5, 2024Updated 2 years ago
jingchenchen / ReasoningConsistency-VQA
View on GitHub
☆13Aug 14, 2022Updated 3 years ago
gicheonkang / gst-visdial
View on GitHub
Official PyTorch Implementation for CVPR'23 Paper, "The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training"
☆20Dec 11, 2023Updated 2 years ago
MILVLG / mcan-vqa
View on GitHub
Deep Modular Co-Attention Networks for Visual Question Answering
☆459Dec 16, 2020Updated 5 years ago
LouChao98 / VLGAE
View on GitHub
Official Implementation for CVPR 2022 paper "Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language …
☆24Oct 19, 2022Updated 3 years ago