Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
☆31Apr 30, 2024Updated last year
Alternatives and similar repositories for ConceptBERT
Users that are interested in ConceptBERT are comparing it to the libraries listed below
Sorting:
- ☆30Dec 16, 2022Updated 3 years ago
- Pytorch Implementation of MUCKO(2020 IJCAI)☆20Oct 25, 2020Updated 5 years ago
- ☆10Jul 23, 2021Updated 4 years ago
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆72Feb 9, 2024Updated 2 years ago
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆13Aug 22, 2023Updated 2 years ago
- ☆14May 10, 2021Updated 4 years ago
- Using image captions with LLM for zero-shot VQA☆18Mar 14, 2024Updated last year
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- Compact Trilinear Interaction for Visual Question Answering (ICCV 2019)☆38Nov 22, 2022Updated 3 years ago
- ☆11Apr 10, 2024Updated last year
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆25Feb 9, 2024Updated 2 years ago
- [ECCV2022] Rethinking Data Augmentation for Robust Visual Question Answering☆13Nov 23, 2022Updated 3 years ago
- ✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"☆45Mar 19, 2023Updated 2 years ago
- PyTorch Implementation of Multi-View Attention Networks for Visual Dialog☆43Mar 24, 2023Updated 2 years ago
- Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)☆48Nov 3, 2022Updated 3 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Jun 13, 2023Updated 2 years ago
- ☆13Aug 14, 2022Updated 3 years ago
- 🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"☆13Feb 1, 2023Updated 3 years ago
- Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379☆97Mar 31, 2020Updated 5 years ago
- Multiple Meta-model Quantifying for Medical Visual Question Answering (MICCAI 2021)☆37Oct 12, 2022Updated 3 years ago
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆65Sep 4, 2021Updated 4 years ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆69Jul 11, 2022Updated 3 years ago
- ☆18May 31, 2023Updated 2 years ago
- Medical Knowledge-Based Network For Patient-oriented Visual Question Answering☆18Feb 25, 2023Updated 3 years ago
- Official PyTorch Implementation for CVPR'23 Paper, "The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training"☆20Dec 11, 2023Updated 2 years ago
- [NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering☆105Apr 6, 2025Updated 11 months ago
- An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER☆166Dec 11, 2022Updated 3 years ago
- PyTorch implementation of paper: "Self-critical Sequence Training for Image Captioning"☆25Apr 8, 2023Updated 2 years ago
- ☆77Nov 22, 2022Updated 3 years ago
- ☆15Aug 13, 2020Updated 5 years ago
- An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)☆87Apr 10, 2022Updated 3 years ago
- DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog☆50Feb 16, 2023Updated 3 years ago
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆22Nov 21, 2023Updated 2 years ago
- [ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"☆69Oct 11, 2021Updated 4 years ago
- A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Common…☆673Jul 6, 2023Updated 2 years ago
- Visual Coreference Resolution in Visual Dialog using Neural Module Networks☆57Oct 12, 2021Updated 4 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆23Aug 22, 2019Updated 6 years ago