wanng-ide / VQA_to_multimodal_surveyView external linksLinks
Update 2020
☆75Mar 21, 2022Updated 3 years ago
Alternatives and similar repositories for VQA_to_multimodal_survey
Users that are interested in VQA_to_multimodal_survey are comparing it to the libraries listed below
Sorting:
- Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)☆23Jun 26, 2020Updated 5 years ago
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Nov 24, 2021Updated 4 years ago
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆72Feb 9, 2024Updated 2 years ago
- Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering☆18Oct 30, 2019Updated 6 years ago
- CMIVQA☆18Jun 3, 2024Updated last year
- A collections of papers about VQA-CP datasets and their results☆41Mar 18, 2022Updated 3 years ago
- ☆22Aug 10, 2020Updated 5 years ago
- Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''☆41Sep 9, 2019Updated 6 years ago
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆100Mar 30, 2023Updated 2 years ago
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 2 years ago
- [ICML2025] LoRA fine-tune directly on the quantized models.☆39Nov 25, 2024Updated last year
- DeVLBert: Learning Deconfounded Visio-Linguistic Representations☆27Nov 27, 2022Updated 3 years ago
- Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering☆27Apr 15, 2021Updated 4 years ago
- PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)☆27Oct 13, 2022Updated 3 years ago
- BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models☆356Dec 4, 2019Updated 6 years ago
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆130Dec 15, 2021Updated 4 years ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆41Mar 23, 2024Updated last year
- CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning☆38Mar 21, 2025Updated 10 months ago
- ☆77Nov 22, 2022Updated 3 years ago
- ☆38Jan 20, 2023Updated 3 years ago
- Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021☆43May 24, 2024Updated last year
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆35Dec 5, 2022Updated 3 years ago
- [CVPR2023] The official repository for paper "Learning Partial Correlation based Deep Visual Representation for Image Classification" To …☆10Nov 21, 2023Updated 2 years ago
- Implementing CNN in PyTorch with Custom Dataset and Transfer Learning☆11Aug 24, 2020Updated 5 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- GAIIC2024无人机视角下的双光目标检测 - Rank6 解决方案☆11Jun 17, 2024Updated last year
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- code for LSN☆10Oct 28, 2024Updated last year
- Promptopia is an open-source AI prompting tool for modern world to discover, create, and share creative prompts☆12May 27, 2023Updated 2 years ago
- [NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models☆158Dec 9, 2024Updated last year
- ☆49Mar 21, 2022Updated 3 years ago
- C++教学用的slides & 作业☆28Dec 18, 2014Updated 11 years ago
- ☆10May 31, 2021Updated 4 years ago
- implement gat with batch☆10Nov 28, 2020Updated 5 years ago
- 此项目创建的初衷是为了帮助人工智能、自然语言处理和大语言模型相关背景的同学找工作使用,欢迎加入项目的建设和维护☆16Mar 30, 2025Updated 10 months ago
- ☆12Aug 14, 2019Updated 6 years ago
- WeChat Moments Demo Project☆11Mar 20, 2023Updated 2 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Mar 17, 2022Updated 3 years ago