Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites
☆443Oct 17, 2023Updated 2 years ago
Alternatives and similar repositories for MedQuAD
Users that are interested in MedQuAD are comparing it to the libraries listed below
Sorting:
- Challenge on Textual Inference and Question Entailment in the Medical Domain https://sites.google.com/view/mediqa2019☆52Jan 27, 2023Updated 3 years ago
- Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems☆317Oct 17, 2023Updated 2 years ago
- Medical Question-Answering datasets prepared for the TREC 2017 LiveQA challenge (Medical Task)☆56Nov 28, 2024Updated last year
- Code for the emrQA question answering dataset☆153Feb 9, 2022Updated 4 years ago
- The medical question entailment data introduced in the AMIA 2016 Paper (Recognizing Question Entailment for Medical Question Answering)☆14Jan 27, 2023Updated 3 years ago
- The gold standard corpus for medication question answering introduced in the MedInfo 2019 paper (Bridging the Gap between Consumers’ Medi…☆21Dec 5, 2025Updated 3 months ago
- Medical question and answer dataset gathered from the web.☆125Dec 9, 2020Updated 5 years ago
- Dataset for medical question summarization introduced in the ACL 2019 paper "On the Summarization of Consumer Health Questions" (A. Ben A…☆32Jan 27, 2023Updated 3 years ago
- Biomedical Question Answering Datasets.☆126Apr 30, 2025Updated 10 months ago
- Machine reading comprehension on clinical case reports☆151Aug 28, 2025Updated 6 months ago
- PubMedQA: A Dataset for Biomedical Research Question Answering☆413Apr 18, 2023Updated 2 years ago
- ☆24May 31, 2025Updated 9 months ago
- BLUE benchmark consists of five different biomedicine text-mining tasks with ten corpora.☆297Jan 12, 2022Updated 4 years ago
- A corpus of Biomedical papers annotated with mentions of UMLS entities.☆344Nov 9, 2021Updated 4 years ago
- Visual Question Generation☆11Aug 20, 2024Updated last year
- MedNLI - A Natural Language Inference Dataset For The Clinical Domain☆133Feb 15, 2023Updated 3 years ago
- HBAM: Hierarchical Bi-directional Word Attention Model☆90Mar 24, 2023Updated 2 years ago
- Code and data for MedQA☆361Dec 1, 2022Updated 3 years ago
- Visual Question Answering in the Medical Domain VQA-Med 2019☆94Jan 12, 2024Updated 2 years ago
- Tools for curating biomedical training data for large-scale language modeling☆495Dec 9, 2024Updated last year
- Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medic…☆285Oct 18, 2023Updated 2 years ago
- ☆27Dec 12, 2024Updated last year
- Data and code from our "Inferring Which Medical Treatments Work from Reports of Clinical Trials", NAACL 2019. This work concerns inferrin…☆65Aug 31, 2021Updated 4 years ago
- Word embeddings trained on medical subreddits.☆10Jan 4, 2021Updated 5 years ago
- CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering☆23Feb 26, 2021Updated 5 years ago
- A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.☆262Nov 28, 2022Updated 3 years ago
- Pre-trained Language Model for Biomedical Question Answering☆125Mar 24, 2023Updated 2 years ago
- Stochastic Answer Networks (SAN) for Machine Reading Comprehension☆149Nov 26, 2018Updated 7 years ago
- code for Question Condensing Networks for Answer Selection in Community Question Answering☆14Aug 26, 2018Updated 7 years ago
- Large Language-and-Vision Assistant for BioMedicine, built towards multimodal GPT-4 level capabilities.☆10Nov 29, 2023Updated 2 years ago
- Labeled dataset of similar and dissimilar medical question pairs created by Curai's doctors☆21Aug 24, 2020Updated 5 years ago
- System for Medical Concept Extraction and Linking☆436Aug 12, 2024Updated last year
- Code and supplementary material for the HealthINF conference paper☆13Jan 19, 2021Updated 5 years ago
- A near real-time named-entity recognizer☆64Feb 18, 2026Updated last month
- [ACL 2019]: Interconnected Question Generation with Coreference Alignment and Conversation Flow Modeling☆88Apr 5, 2020Updated 5 years ago
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆114Aug 22, 2024Updated last year
- [NeurIPS'22] EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records☆103Mar 12, 2026Updated last week
- PubMed PICO Element Detection Dataset☆57Jul 19, 2018Updated 7 years ago
- Bioinformatics'2020: BioBERT: a pre-trained biomedical language representation model for biomedical text mining☆2,191Aug 13, 2023Updated 2 years ago