drqiaojin/biomedical-qa-datasets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/drqiaojin/biomedical-qa-datasets)

drqiaojin / biomedical-qa-datasets

Biomedical Question Answering Datasets.

☆131

Alternatives and similar repositories for biomedical-qa-datasets

Users that are interested in biomedical-qa-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ratschlab / mmugl
View on GitHub
Code repository for MMUGL: Multi-modal Graph Learning over UMLS Knowledge Graphs
☆11Dec 7, 2023Updated 2 years ago
ncbi-nlp / MedCalc-Bench
View on GitHub
[NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations
☆93Dec 18, 2025Updated 7 months ago
abachaa / MedQuAD
View on GitHub
Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites
☆458Oct 17, 2023Updated 2 years ago
ncbi-nlp / cell-o1
View on GitHub
Code and data for Cell-o1.
☆28Updated this week
ritaranx / BMRetriever
View on GitHub
[EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".
☆26Sep 19, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
anazhaw / Bio-SODA
View on GitHub
A Question Answering System for Domain Knowledge Graphs
☆11Feb 24, 2022Updated 4 years ago
som-shahlab / med-nota
View on GitHub
☆15Jun 11, 2025Updated last year
abachaa / Existing-Medical-QA-Datasets
View on GitHub
Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems
☆317Oct 17, 2023Updated 2 years ago
zhao-zy15 / PMC-Patients
View on GitHub
PMC-Patients
☆113Jun 7, 2024Updated 2 years ago
wizardlancet / diagnosis_zero
View on GitHub
diagnosis_zero, R1 Zero reproduce on disease diagnosis
☆32Jul 24, 2025Updated 11 months ago
GanjinZero / math401-llm
View on GitHub
Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?
☆57Apr 17, 2023Updated 3 years ago
drqiaojin / AttnMeSH
View on GitHub
Code for AttentionMeSH
☆17Oct 5, 2018Updated 7 years ago
LHNCBC / SemRep
View on GitHub
☆72May 31, 2022Updated 4 years ago
ncbi / MedCPT
View on GitHub
Code for MedCPT, a model for zero-shot biomedical information retrieval.
☆269Mar 24, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
pmc-patients / pmc-patients
View on GitHub
PMC-Patients: A Large-scale Dataset of Patient Summaries and Relations for Benchmarking Retrieval-based Clinical Decision Support Systems…
☆82Dec 20, 2023Updated 2 years ago
GanjinZero / CODER
View on GitHub
CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]
☆80Jun 28, 2022Updated 4 years ago
bvanaken / clinical-outcome-prediction
View on GitHub
Code for the EACL 2021 Paper: Clinical Outcome Prediction from Admission Notes using Self-Supervised Knowledge Integration
☆99Jul 31, 2024Updated last year
gzxiong / MedRAG
View on GitHub
Code for the MedRAG toolkit
☆579May 8, 2025Updated last year
pubmedqa / pubmedqa
View on GitHub
PubMedQA: A Dataset for Biomedical Research Question Answering
☆433Apr 18, 2023Updated 3 years ago
LHNCBC / metamaplite
View on GitHub
A near real-time named-entity recognizer
☆66May 20, 2026Updated 2 months ago
chanzuckerberg / MedMentions
View on GitHub
A corpus of Biomedical papers annotated with mentions of UMLS entities.
☆346Nov 9, 2021Updated 4 years ago
vlievin / medical-reasoning
View on GitHub
Medical reasoning using large language models
☆94Jan 9, 2024Updated 2 years ago
drqiaojin / bioelmo
View on GitHub
BioELMo is a biomedical version of embeddings from language model (ELMo), pre-trained on PubMed abstracts.
☆32Dec 3, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sohampoddar26 / caves-data
View on GitHub
CAVES-dataset accepted at SIGIR'22
☆13Aug 9, 2024Updated last year
allenai / feb
View on GitHub
Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"
☆12Apr 27, 2022Updated 4 years ago
GanjinZero / BioBART
View on GitHub
BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]
☆52Oct 26, 2022Updated 3 years ago
Yuanhy1997 / Auto-Diagnosis-by-RL-and-Classification
View on GitHub
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification [AI in Medicine Journal]
☆14May 20, 2022Updated 4 years ago
kaishxu / DFMed
View on GitHub
Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)
☆14Nov 22, 2023Updated 2 years ago
mingze-yuan / Awesome-LLM-Healthcare
View on GitHub
The paper list of the review on LLMs in medicine - "Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assis…
☆269Dec 23, 2023Updated 2 years ago
iwangjian / pyloader
View on GitHub
🐳 PyLoader: An asynchronous Python dataloader for loading big datasets, supporting PyTorch and TensorFlow 2.x.
☆11Aug 29, 2021Updated 4 years ago
wjhou / Recap
View on GitHub
[EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning
☆28Jun 12, 2025Updated last year
wangjs9 / CARE-master
View on GitHub
PyTorch implementation of CARE
☆16Oct 6, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
LuChang-CS / icd_hierarchical_structure
View on GitHub
The JSON file for the ICD-9-CM and ICD-10-CM hierarchy, including diagnosis codes and procedure codes
☆14Jan 26, 2023Updated 3 years ago
panushri25 / emrQA
View on GitHub
Code for the emrQA question answering dataset
☆153Feb 9, 2022Updated 4 years ago
omotolani12 / Building-an-Advanced-RAG-Chatbot-with-Knowledge-Graphs
View on GitHub
☆12Jun 12, 2024Updated 2 years ago
FutureForMe / MADKE
View on GitHub
☆14Jan 6, 2025Updated last year
sunlab-osu / CliniQG4QA
View on GitHub
CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering
☆23Feb 26, 2021Updated 5 years ago
Alibaba-NLP / EBM-Net
View on GitHub
Codes for the EMNLP'2020 paper "Predicting Clinical Trial Results by Implicit Evidence Integration".
☆14Jan 13, 2021Updated 5 years ago
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆12Jul 28, 2025Updated 11 months ago