Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"
☆21May 8, 2023Updated 3 years ago
Alternatives and similar repositories for VLC-BERT
Users that are interested in VLC-BERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆24Feb 9, 2024Updated 2 years ago
- virtual node analysis on ogb benchmark dataset☆14Mar 9, 2023Updated 3 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 3 years ago
- Code for our ACL-2023 paper: "Combo of Thinking and Observing for Outside-Knowledge VQA"☆12Jun 30, 2023Updated 2 years ago
- ☆30Dec 16, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆101Mar 30, 2023Updated 3 years ago
- Rich Visual Knowledge-based AugmentationNetwork for Visual Question Answering☆10Dec 6, 2019Updated 6 years ago
- [ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering☆13Nov 23, 2022Updated 3 years ago
- Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Fee…☆27Nov 11, 2023Updated 2 years ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆42Mar 23, 2024Updated 2 years ago
- ☆19May 31, 2023Updated 2 years ago
- Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering☆11Feb 16, 2023Updated 3 years ago
- Official implementation for the MM'22 paper.☆14Jun 30, 2022Updated 3 years ago
- [CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering☆21May 28, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆16Dec 25, 2021Updated 4 years ago
- Weakly Supervised Grounding for VQA in Vision-Language Transformers☆16May 6, 2023Updated 3 years ago
- [ICLR 2025] No Preference Left Behind: Group Distributional Preference Optimization☆16Apr 21, 2025Updated last year
- The implementation of our IEEE S&P 2024 paper "Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples".☆11Jun 28, 2024Updated last year
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆10Sep 3, 2024Updated last year
- Source code of our TCSVT 2020 paper "Multi-level Knowledge Injecting for Visual Commonsense Reasoning"☆11Sep 18, 2024Updated last year
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆22Nov 21, 2023Updated 2 years ago
- [ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"☆68Oct 11, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Aug 17, 2022Updated 3 years ago
- ☆34Jun 27, 2022Updated 3 years ago
- Authors's code for "Variational Causal Inference Network for Explanatory Visual Question Answering" and "Integrating Neural-Symbolic Reas…☆12Apr 13, 2026Updated last month
- [Findings of ACL 2023] Bridge the Gap Between CV and NLP! A Optimization-based Textual Adversarial Attack Framework.☆14Aug 27, 2023Updated 2 years ago
- YesBut - Multimodal Satire Comprehension Dataset☆19Oct 23, 2024Updated last year
- DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog☆25Mar 8, 2022Updated 4 years ago
- Deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and…☆15Sep 12, 2018Updated 7 years ago
- [CVPR 2023] Pytorch Code of MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering☆17Jul 11, 2023Updated 2 years ago
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper "Jailbreak Large Vision-Language Models Through Multi-Modal Linkage"☆33Dec 6, 2024Updated last year
- Universal Adversarial Perturbations for Vision-Language Pre-trained Models☆24Aug 8, 2025Updated 9 months ago
- [ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model☆17Apr 28, 2025Updated last year
- ☆15May 18, 2026Updated last week
- Implementation for the CVPR 2023 paper "Improving Selective Visual Question Answering by Learning from Your Peers" (https://arxiv.org/abs…☆25Jul 20, 2023Updated 2 years ago
- ☆10Apr 20, 2018Updated 8 years ago
- [CVPR2024 Highlight] Strong Transferable Adversarial Attacks via Ensembled Asymptotically Normal Distribution Learning☆19Jun 14, 2024Updated last year