val-iisc/RMLVQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/val-iisc/RMLVQA)

val-iisc / RMLVQA

☆19

Alternatives and similar repositories for RMLVQA

Users that are interested in RMLVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

szzexpoi / POEM
View on GitHub
Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…
☆10Jun 16, 2024Updated 2 years ago
jingjing12110 / MixPHM
View on GitHub
[CVPR 2023] Pytorch Code of MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering
☆17Jul 11, 2023Updated 3 years ago
facebookresearch / selective-vqa_ood
View on GitHub
Implementation for the CVPR 2023 paper "Improving Selective Visual Question Answering by Learning from Your Peers" (https://arxiv.org/abs…
☆26Jul 20, 2023Updated 3 years ago
chojw / genb
View on GitHub
Generative Bias for Robust Visual Question Answering ( CVPR 2023 )
☆28Jul 4, 2023Updated 3 years ago
shenxiang-vqa / LSAT
View on GitHub
Local self-attention in Transformer for visual question answering
☆13Mar 17, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yuleiniu / introd
View on GitHub
[NeurIPS 2021] Introspective Distillation for Robust Question Answering
☆13Dec 7, 2021Updated 4 years ago
ovguyo / captions-in-VQA
View on GitHub
Using image captions with LLM for zero-shot VQA
☆19Mar 14, 2024Updated 2 years ago
PhoebusSi / MMBS
View on GitHub
Code for our EMNLP-2022 paper: "Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning"
☆16Feb 22, 2023Updated 3 years ago
GeraldHan / GGE
View on GitHub
Code for Greedy Gradient Ensemble for Visual Question Answering （ICCV 2021, Oral）
☆27Mar 28, 2022Updated 4 years ago
gqa-ood / GQA-OOD
View on GitHub
GQA-OOD is a new dataset and benchmark for the evaluation of VQA models in OOD (out of distribution) settings.
☆33Mar 1, 2021Updated 5 years ago
PhoebusSi / Thinking-while-Observing
View on GitHub
Code for our ACL-2023 paper: "Combo of Thinking and Observing for Outside-Knowledge VQA"
☆12Jun 30, 2023Updated 3 years ago
Zhiquan-Wen / D-VQA
View on GitHub
PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)
☆26Oct 13, 2022Updated 3 years ago
aioz-ai / CFR_VQA
View on GitHub
Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)
☆49Apr 22, 2026Updated 3 months ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
zhangxi1997 / VQACL
View on GitHub
VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)
☆45Mar 28, 2024Updated 2 years ago
ItemZheng / KDDAug
View on GitHub
[ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering
☆13Nov 23, 2022Updated 3 years ago
codezakh / SelTDA
View on GitHub
[CVPR 23] Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
☆17May 14, 2024Updated 2 years ago
alexandrosXe / A-Simple-Baseline-For-Knowledge-Based-VQA
View on GitHub
Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"
☆25Dec 14, 2023Updated 2 years ago
rentainhe / TRAR-VQA
View on GitHub
[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
☆68Oct 11, 2021Updated 4 years ago
wenhuchen / Meta-Module-Network
View on GitHub
Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"
☆43May 13, 2021Updated 5 years ago
CCIIPLab / DPT
View on GitHub
The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering
☆20May 10, 2022Updated 4 years ago
lmb-freiburg / ovqa
View on GitHub
☆15Jun 17, 2026Updated last month
facebookresearch / reliable_vqa
View on GitHub
Implementation for the paper "Reliable Visual Question Answering Abstain Rather Than Answer Incorrectly" (ECCV 2022: https//arxiv.org/abs…
☆41May 19, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
AndersonStra / MuKEA
View on GitHub
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
☆101Mar 30, 2023Updated 3 years ago
yuleiniu / cfvqa
View on GitHub
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
☆136Dec 15, 2021Updated 4 years ago
rabiulcste / vqazero
View on GitHub
visual question answering prompting recipes for large vision-language models
☆29Sep 14, 2024Updated last year
xiaojino / RUArt
View on GitHub
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
☆10Nov 27, 2022Updated 3 years ago
ZihaoW123 / UniMM
View on GitHub
Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"
☆13May 12, 2023Updated 3 years ago
CGCL-codes / Gen-AF
View on GitHub
The implementation of our IEEE S&P 2024 paper "Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples".
☆11Jun 28, 2024Updated 2 years ago
TIAN-viola / DynRT
View on GitHub
Official implementation of Dynamic Routing Transformer Network for Multimodal Sarcasm Detection (ACL'23)
☆35Jul 9, 2023Updated 3 years ago
microsoft / PICa
View on GitHub
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)
☆88Apr 10, 2022Updated 4 years ago
xvolcano02 / DMSD-CL
View on GitHub
[AAAI2024] Debiasing Multimodal Sarcasm Detection with Contrastive Learning
☆17Jan 5, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Gary-code / KECVQG
View on GitHub
[ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"
☆10Sep 3, 2024Updated last year
alirezasalemi7 / DEDR-MM-FiD
View on GitHub
the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering
☆14Aug 22, 2023Updated 2 years ago
ThalesGroup / ConceptBERT
View on GitHub
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
☆31Apr 30, 2024Updated 2 years ago
showlab / CLVQA
View on GitHub
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)
☆42Mar 23, 2024Updated 2 years ago
CR-Gjx / Img2Prompt
View on GitHub
Evaluation codes of "From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models".
☆17May 15, 2023Updated 3 years ago
marsggbo / EAGAN
View on GitHub
(ECCV2022) EAGAN: EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs
☆12Sep 15, 2022Updated 3 years ago
SRI-CSL / TrinityMultimodalTrojAI
View on GitHub
☆35Jun 27, 2022Updated 4 years ago