Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems
☆319Oct 17, 2023Updated 2 years ago
Alternatives and similar repositories for Existing-Medical-QA-Datasets
Users that are interested in Existing-Medical-QA-Datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites☆455Oct 17, 2023Updated 2 years ago
- Dataset for medical question summarization introduced in the ACL 2019 paper "On the Summarization of Consumer Health Questions" (A. Ben A…☆33May 13, 2026Updated last week
- Visual Question Answering in the Medical Domain VQA-Med 2019☆94May 13, 2026Updated last week
- PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modal…☆233Dec 6, 2024Updated last year
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆89Dec 18, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Corpus of Online Medical EnTities: the cometA corpus☆51Mar 6, 2025Updated last year
- ☆71Feb 3, 2025Updated last year
- CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering☆23Feb 26, 2021Updated 5 years ago
- Medical Question-Answering datasets prepared for the TREC 2017 LiveQA challenge (Medical Task)☆56May 13, 2026Updated last week
- ☆19Oct 13, 2022Updated 3 years ago
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…☆99Feb 6, 2026Updated 3 months ago
- Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"☆17Jul 20, 2025Updated 10 months ago
- EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images (NeurIPS 2023 D&B)☆95Feb 6, 2026Updated 3 months ago
- Code implementation of RP3D-Diag☆79Aug 29, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medic…☆286Oct 18, 2023Updated 2 years ago
- Dataset of medical images, captions, subfigure-subcaption annotations, and inline textual references☆173Feb 19, 2026Updated 3 months ago
- [Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medi…☆2,012Sep 27, 2025Updated 7 months ago
- A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.☆276Nov 28, 2022Updated 3 years ago
- [Nature Communications] The official codes for "Towards Building Multilingual Language Model for Medicine"☆281May 9, 2025Updated last year
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆78May 5, 2025Updated last year
- VQA-Med 2021☆22May 13, 2026Updated last week
- The medical question entailment data introduced in the AMIA 2016 Paper (Recognizing Question Entailment for Medical Question Answering)☆14May 13, 2026Updated last week
- Code for the emrQA question answering dataset☆153Feb 9, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection of resources on applications of multi-modal learning in medical imaging.☆956Feb 8, 2026Updated 3 months ago
- Tools for curating biomedical training data for large-scale language modeling☆498Dec 9, 2024Updated last year
- Biomedical Question Answering Datasets.☆129Apr 30, 2025Updated last year
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆99May 17, 2025Updated last year
- Code and data for MedQA☆378Dec 1, 2022Updated 3 years ago
- Medical Visual Question Answering via Conditional Reasoning [ACM MM 2020]☆64Aug 20, 2021Updated 4 years ago
- Evaluation Pipeline for medical tasks.☆12Apr 8, 2026Updated last month
- CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]☆79Jun 28, 2022Updated 3 years ago
- MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction (COLING 2022)☆18Oct 13, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [NeurIPS 2025] ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World☆135Aug 18, 2024Updated last year
- BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).☆592Mar 25, 2023Updated 3 years ago
- The paper list of the review on LLMs in medicine - "Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assis…☆267Dec 23, 2023Updated 2 years ago
- LLM finetuned for medical question answering☆559Sep 7, 2023Updated 2 years ago
- Localized questions for VQA☆11May 6, 2025Updated last year
- PubMedQA: A Dataset for Biomedical Research Question Answering☆422Apr 18, 2023Updated 3 years ago
- ☆13Oct 20, 2022Updated 3 years ago