szzexpoi/POEM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/szzexpoi/POEM)

szzexpoi / POEM

Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning"

☆10

Alternatives and similar repositories for POEM

Users that are interested in POEM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

val-iisc / RMLVQA
View on GitHub
☆19May 31, 2023Updated 3 years ago
zhangxi1997 / VQACL
View on GitHub
VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)
☆45Mar 28, 2024Updated 2 years ago
showlab / CLVQA
View on GitHub
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)
☆42Mar 23, 2024Updated 2 years ago
aditya10 / VLC-BERT
View on GitHub
Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"
☆21May 8, 2023Updated 3 years ago
AlonMendelson / SGVL
View on GitHub
☆17Dec 13, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yuleiniu / introd
View on GitHub
[NeurIPS 2021] Introspective Distillation for Robust Question Answering
☆13Dec 7, 2021Updated 4 years ago
GaryJiajia / OFv2_ICL_VQA
View on GitHub
[CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering
☆21May 28, 2025Updated last year
facebookresearch / reliable_vqa
View on GitHub
Implementation for the paper "Reliable Visual Question Answering Abstain Rather Than Answer Incorrectly" (ECCV 2022: https//arxiv.org/abs…
☆41May 19, 2023Updated 3 years ago
Zhiquan-Wen / D-VQA
View on GitHub
PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)
☆26Oct 13, 2022Updated 3 years ago
rabiulcste / vqazero
View on GitHub
visual question answering prompting recipes for large vision-language models
☆29Sep 14, 2024Updated last year
UMass-Embodied-AGI / VisualCoT
View on GitHub
Codebase for AAAI 2024 conference paper Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning
☆40Mar 12, 2025Updated last year
ovguyo / captions-in-VQA
View on GitHub
Using image captions with LLM for zero-shot VQA
☆19Mar 14, 2024Updated 2 years ago
microsoft / PICa
View on GitHub
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)
☆88Apr 10, 2022Updated 4 years ago
wenhuchen / Meta-Module-Network
View on GitHub
Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"
☆43May 13, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
saccharomycetes / visual_crop_zsvqa
View on GitHub
☆12Apr 10, 2024Updated 2 years ago
aioz-ai / CFR_VQA
View on GitHub
Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)
☆48Apr 22, 2026Updated 3 months ago
jingyi-zhang / LiDARCap
View on GitHub
☆16Jun 22, 2024Updated 2 years ago
shenxiang-vqa / LSAT
View on GitHub
Local self-attention in Transformer for visual question answering
☆13Mar 17, 2024Updated 2 years ago
AndersonStra / MuKEA
View on GitHub
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
☆101Mar 30, 2023Updated 3 years ago
BierOne / Attention-Faithfulness
View on GitHub
[ICML 2022] This is the pytorch implementation of "Rethinking Attention-Model Explainability through Faithfulness Violation Test" (https:…
☆20Jul 21, 2022Updated 4 years ago
tejas-gokhale / vqa_mutant
View on GitHub
☆13Feb 14, 2022Updated 4 years ago
BatsResearch / ex2
View on GitHub
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Apr 4, 2024Updated 2 years ago
archiki / RepARe
View on GitHub
☆21Oct 10, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LouChao98 / VLGAE
View on GitHub
Official Implementation for CVPR 2022 paper "Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language …
☆24Oct 19, 2022Updated 3 years ago
luomancs / retriever_reader_for_okvqa
View on GitHub
☆19Dec 8, 2022Updated 3 years ago
gqa-ood / GQA-OOD
View on GitHub
GQA-OOD is a new dataset and benchmark for the evaluation of VQA models in OOD (out of distribution) settings.
☆33Mar 1, 2021Updated 5 years ago
CR-Gjx / Img2Prompt
View on GitHub
Evaluation codes of "From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models".
☆18May 15, 2023Updated 3 years ago
MILVLG / prophet
View on GitHub
Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".
☆278Jun 14, 2025Updated last year
guoyang9 / UnifER
View on GitHub
Official implementation for the MM'22 paper.
☆14Jun 30, 2022Updated 4 years ago
hackerchenzhuo / LaKo
View on GitHub
[Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection
☆24Feb 9, 2024Updated 2 years ago
yashkant / concat-vqa
View on GitHub
Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021
☆19Jul 27, 2021Updated 5 years ago
rentainhe / TRAR-VQA
View on GitHub
[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
☆68Oct 11, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shijx12 / XNM-Net
View on GitHub
Pytorch implementation of "Explainable and Explicit Visual Reasoning over Scene Graphs "
☆94Mar 17, 2019Updated 7 years ago
szzexpoi / rex
View on GitHub
Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"
☆22Nov 21, 2023Updated 2 years ago
ceyzaguirre4 / NSM
View on GitHub
Neural State Machine implemented in PyTorch
☆71Oct 10, 2019Updated 6 years ago
ExplainableML / CLEVR-X
View on GitHub
CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
☆30Oct 27, 2023Updated 2 years ago
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
mshukor / EvALign-ICL
View on GitHub
[ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …
☆22Mar 1, 2024Updated 2 years ago
yanxinzju / CSS-VQA
View on GitHub
Counterfactual Samples Synthesizing for Robust VQA
☆78Nov 24, 2022Updated 3 years ago