Code for our ACL-2023 paper: "Combo of Thinking and Observing for Outside-Knowledge VQA"
☆12Jun 30, 2023Updated 2 years ago
Alternatives and similar repositories for Thinking-while-Observing
Users that are interested in Thinking-while-Observing are comparing it to the libraries listed below
Sorting:
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆13Aug 22, 2023Updated 2 years ago
- Official implementation for the MM'22 paper.☆14Jun 30, 2022Updated 3 years ago
- [Paper][IJCKG 2022] LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection☆25Feb 9, 2024Updated 2 years ago
- ☆30Dec 16, 2022Updated 3 years ago
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆69Jul 11, 2022Updated 3 years ago
- ☆18May 31, 2023Updated 2 years ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 2 years ago
- This is the official repository for Retrieval Augmented Visual Question Answering☆244Dec 19, 2024Updated last year
- MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering☆100Mar 30, 2023Updated 2 years ago
- Pytorch Implementation of MUCKO(2020 IJCAI)☆20Oct 25, 2020Updated 5 years ago
- a multimodal retrieval dataset☆24Jul 8, 2023Updated 2 years ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- SyMuRBench: Benchmark for symbolic music representations☆17Nov 6, 2025Updated 3 months ago
- ☆10Nov 29, 2022Updated 3 years ago
- ☆12Jan 4, 2023Updated 3 years ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- ☆12Dec 8, 2022Updated 3 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- Public code repository to reproduce our MICCAI 2022 paper: "Automatic identification of segmentation errors for radiotherapy using geomet…☆11Dec 8, 2022Updated 3 years ago
- 学习记录☆11Oct 30, 2024Updated last year
- EHR datasets preprocessing scripts☆11Jan 31, 2024Updated 2 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- The implementation of our IEEE S&P 2024 paper "Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples".☆11Jun 28, 2024Updated last year
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Oct 19, 2022Updated 3 years ago
- Official repository for the ICCV 2021 (Oral) paper "(Just) A Spoonful of Refinements Helps the Registration Error Go Down"☆11Dec 21, 2021Updated 4 years ago
- Surface geometry plugin for Rhinoceros 3D☆10Aug 21, 2018Updated 7 years ago
- The backup repository for FairytaleQA dataset and paper "Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset f…☆10May 30, 2023Updated 2 years ago
- This contains example code from "A Primer on Topological Data Analysis to Support Image Analysis Tasks in Environmental Science" for runn…☆10Oct 10, 2022Updated 3 years ago
- ACL 2023 *oral* paper "MGR: Multi-generator based Rationalization"☆10Nov 21, 2024Updated last year
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated 11 months ago
- The official repository for the paper entitled "Time Travel in LLMs: Tracing Data Contamination in Large Language Models."☆12Jun 11, 2024Updated last year
- Automaton & Cognition☆16Apr 14, 2024Updated last year
- ☆10Jan 11, 2023Updated 3 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- ☆11Sep 10, 2023Updated 2 years ago
- The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…☆17Jan 24, 2025Updated last year
- FOLPSν is a code for efficiently evaluating the redshift space power spectrum in the presence of massive neutrinos☆11Jun 22, 2025Updated 8 months ago
- ☆13Aug 17, 2022Updated 3 years ago