abachaa/Existing-Medical-QA-Datasets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/abachaa/Existing-Medical-QA-Datasets)

abachaa / Existing-Medical-QA-Datasets

Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems

☆317

Alternatives and similar repositories for Existing-Medical-QA-Datasets

Users that are interested in Existing-Medical-QA-Datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

abachaa / MedQuAD
View on GitHub
Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites
☆458Oct 17, 2023Updated 2 years ago
abachaa / MEDIQA2021
View on GitHub
☆24May 13, 2026Updated 2 months ago
abachaa / MeQSum
View on GitHub
Dataset for medical question summarization introduced in the ACL 2019 paper "On the Summarization of Consumer Health Questions" (A. Ben A…
☆33May 13, 2026Updated 2 months ago
razorx89 / roco-dataset
View on GitHub
Radiology Objects in COntext (ROCO): A Multimodal Image Dataset
☆249Apr 5, 2022Updated 4 years ago
abachaa / VQA-Med-2019
View on GitHub
Visual Question Answering in the Medical Domain VQA-Med 2019
☆95May 13, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xiaoman-zhang / PMC-VQA
View on GitHub
PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modal…
☆236Dec 6, 2024Updated last year
cambridgeltl / cometa
View on GitHub
Corpus of Online Medical EnTities: the cometA corpus
☆52Mar 6, 2025Updated last year
sunlab-osu / CliniQG4QA
View on GitHub
CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering
☆23Feb 26, 2021Updated 5 years ago
Holipori / MIMIC-Diff-VQA
View on GitHub
☆73Feb 3, 2025Updated last year
ncbi-nlp / MedCalc-Bench
View on GitHub
[NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations
☆93Dec 18, 2025Updated 7 months ago
abachaa / LiveQA_MedicalTask_TREC2017
View on GitHub
Medical Question-Answering datasets prepared for the TREC 2017 LiveQA challenge (Medical Task)
☆56May 13, 2026Updated 2 months ago
elehman16 / discq
View on GitHub
☆19Oct 13, 2022Updated 3 years ago
baeseongsu / mimic-cxr-vqa
View on GitHub
A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…
☆100Feb 6, 2026Updated 5 months ago
baeseongsu / ehrxqa
View on GitHub
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images (NeurIPS 2023 D&B)
☆98Feb 6, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
qiaoyu-zheng / RP3D-Diag
View on GitHub
Code implementation of RP3D-Diag
☆79Aug 29, 2025Updated 10 months ago
stefanhgm / patient_summaries_with_llms
View on GitHub
Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"
☆17Jul 20, 2025Updated last year
McGill-NLP / medal
View on GitHub
Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medic…
☆285Oct 18, 2023Updated 2 years ago
allenai / medicat
View on GitHub
Dataset of medical images, captions, subfigure-subcaption annotations, and inline textual references
☆175Feb 19, 2026Updated 5 months ago
abachaa / MEDIQA2019
View on GitHub
Challenge on Textual Inference and Question Entailment in the Medical Domain https://sites.google.com/view/mediqa2019
☆52May 13, 2026Updated 2 months ago
AI-in-Health / MedLLMsPracticalGuide
View on GitHub
[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medi…
☆2,034Jul 10, 2026Updated 2 weeks ago
MAGIC-AI4Med / MMedLM
View on GitHub
[Nature Communications] The official codes for "Towards Building Multilingual Language Model for Medicine"
☆284May 9, 2025Updated last year
MAGIC-AI4Med / MedS-Ins
View on GitHub
[npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"
☆79May 5, 2025Updated last year
abachaa / RQE_Data_AMIA2016
View on GitHub
The medical question entailment data introduced in the AMIA 2016 Paper (Recognizing Question Entailment for Medical Question Answering)
☆14May 13, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
panushri25 / emrQA
View on GitHub
Code for the emrQA question answering dataset
☆153Feb 9, 2022Updated 4 years ago
bigscience-workshop / biomedical
View on GitHub
Tools for curating biomedical training data for large-scale language modeling
☆506Dec 9, 2024Updated last year
drqiaojin / biomedical-qa-datasets
View on GitHub
Biomedical Question Answering Datasets.
☆131Apr 30, 2025Updated last year
richard-peng-xia / awesome-multimodal-in-medical-imaging
View on GitHub
A collection of resources on applications of multi-modal learning in medical imaging.
☆972Updated this week
TsinghuaC3I / UltraMedical
View on GitHub
[NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine
☆97Sep 26, 2024Updated last year
som-shahlab / medalign
View on GitHub
MedAlign is a clinician-generated dataset for instruction following with electronic medical records.
☆102May 17, 2025Updated last year
jind11 / MedQA
View on GitHub
Code and data for MedQA
☆388Dec 1, 2022Updated 3 years ago
Awenbocc / med-vqa
View on GitHub
Medical Visual Question Answering via Conditional Reasoning [ACM MM 2020]
☆64Aug 20, 2021Updated 4 years ago
gmpoli / electramed
View on GitHub
☆13Oct 20, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
GanjinZero / CODER
View on GitHub
CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]
☆80Jun 28, 2022Updated 4 years ago
abachaa / Medication_QA_MedInfo2019
View on GitHub
The gold standard corpus for medication question answering introduced in the MedInfo 2019 paper (Bridging the Gap between Consumers’ Medi…
☆21Dec 5, 2025Updated 7 months ago
suamin / MedDistant19
View on GitHub
MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction (COLING 2022)
☆19Oct 13, 2022Updated 3 years ago
ncbi-nlp / bluebert
View on GitHub
BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).
☆597Mar 25, 2023Updated 3 years ago
pubmedqa / pubmedqa
View on GitHub
PubMedQA: A Dataset for Biomedical Research Question Answering
☆433Apr 18, 2023Updated 3 years ago
Demon702 / WorldValuesBench
View on GitHub
☆21Mar 17, 2025Updated last year
kbressem / medAlpaca
View on GitHub
LLM finetuned for medical question answering
☆563Sep 7, 2023Updated 2 years ago