Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems
☆319Oct 17, 2023Updated 2 years ago
Alternatives and similar repositories for Existing-Medical-QA-Datasets
Users that are interested in Existing-Medical-QA-Datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites☆454Oct 17, 2023Updated 2 years ago
- ☆24May 13, 2026Updated last month
- Dataset for medical question summarization introduced in the ACL 2019 paper "On the Summarization of Consumer Health Questions" (A. Ben A…☆33May 13, 2026Updated last month
- Radiology Objects in COntext (ROCO): A Multimodal Image Dataset☆246Apr 5, 2022Updated 4 years ago
- Visual Question Answering in the Medical Domain VQA-Med 2019☆95May 13, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modal…☆233Dec 6, 2024Updated last year
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆92Dec 18, 2025Updated 5 months ago
- Corpus of Online Medical EnTities: the cometA corpus☆51Mar 6, 2025Updated last year
- ☆73Feb 3, 2025Updated last year
- Medical Question-Answering datasets prepared for the TREC 2017 LiveQA challenge (Medical Task)☆56May 13, 2026Updated last month
- ☆19Oct 13, 2022Updated 3 years ago
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…☆100Feb 6, 2026Updated 4 months ago
- Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"☆17Jul 20, 2025Updated 10 months ago
- Code implementation of RP3D-Diag☆79Aug 29, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medic…☆286Oct 18, 2023Updated 2 years ago
- Dataset of medical images, captions, subfigure-subcaption annotations, and inline textual references☆175Feb 19, 2026Updated 3 months ago
- [Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medi…☆2,020Sep 27, 2025Updated 8 months ago
- A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.☆276Nov 28, 2022Updated 3 years ago
- [Nature Communications] The official codes for "Towards Building Multilingual Language Model for Medicine"☆281May 9, 2025Updated last year
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆78May 5, 2025Updated last year
- VQA-Med 2021☆23May 13, 2026Updated last month
- Challenge on Textual Inference and Question Entailment in the Medical Domain https://sites.google.com/view/mediqa2019☆52May 13, 2026Updated last month
- The medical question entailment data introduced in the AMIA 2016 Paper (Recognizing Question Entailment for Medical Question Answering)☆14May 13, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the emrQA question answering dataset☆153Feb 9, 2022Updated 4 years ago
- A collection of resources on applications of multi-modal learning in medical imaging.☆963Jun 4, 2026Updated last week
- Tools for curating biomedical training data for large-scale language modeling☆500Dec 9, 2024Updated last year
- Biomedical Question Answering Datasets.☆129Apr 30, 2025Updated last year
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆96Sep 26, 2024Updated last year
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆100May 17, 2025Updated last year
- Code and data for MedQA☆384Dec 1, 2022Updated 3 years ago
- Medical Visual Question Answering via Conditional Reasoning [ACM MM 2020]☆64Aug 20, 2021Updated 4 years ago
- Evaluation Pipeline for medical tasks.☆12Apr 8, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]☆80Jun 28, 2022Updated 3 years ago
- MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction (COLING 2022)☆18Oct 13, 2022Updated 3 years ago
- The gold standard corpus for medication question answering introduced in the MedInfo 2019 paper (Bridging the Gap between Consumers’ Medi…☆21Dec 5, 2025Updated 6 months ago
- [NeurIPS 2025] ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World☆138Aug 18, 2024Updated last year
- The paper list of the review on LLMs in medicine - "Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assis…☆269Dec 23, 2023Updated 2 years ago
- BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).☆595Mar 25, 2023Updated 3 years ago
- LLM finetuned for medical question answering☆559Sep 7, 2023Updated 2 years ago