ExpertOpsAI / DataHubLinks
A central repository for curating and managing diverse datasets used in healthcare applications.
☆11Updated last year
Alternatives and similar repositories for DataHub
Users that are interested in DataHub are comparing it to the libraries listed below
Sorting:
- MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents☆189Updated last month
- A collection of AI models tailored for healthcare applications.☆46Updated last year
- Code repository for the framework to engage in clinical decision making task using the MIMIC-CDM dataset.☆46Updated 10 months ago
- Large Language Models to Identify Social Determinants of Health in Electronic Health Records | Paper: https://www.nature.com/articles/s41…☆48Updated last year
- [NeurIPS 2024 D&B] Official code for "EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries"☆37Updated 11 months ago
- Prompting Large Language Models for Zero-Shot Clinical Prediction with Structured Longitudinal Electronic Health Record Data☆27Updated last year
- Reading list for multimodal learning in healthcare☆11Updated 2 years ago
- Code and data for TrialGPT.☆138Updated 11 months ago
- A benchmark for few-shot evaluation of foundation models for electronic health records (EHRs)☆207Updated 6 months ago
- ☆31Updated 3 years ago
- Deep Generative Modelling of Patient Timelines using Electronic Health Records☆69Updated 6 months ago
- A novel medical large language model family with 13/70B parameters, which have SOTA performances on various medical tasks☆165Updated 11 months ago
- A curated collection of cutting-edge research at the intersection of machine learning and healthcare. This repository will be actively ma…☆34Updated 8 months ago
- public code repository for paper "Health system scale language models are general purpose clinical prediction engines"☆120Updated 2 years ago
- ☆46Updated 2 years ago
- Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records☆24Updated last year
- [npj Digital Medicine 2025] Multiple Embedding Model for EHR (MEME) used for strong prediction on Emergency Department tasks☆18Updated 5 months ago
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆98Updated 7 months ago
- Repo about the MultiCaRe Dataset, with demo notebooks and details about how it was created.☆69Updated 2 months ago
- Expert-Curated Oncology Reports to Advance Language Model Inference☆30Updated last year
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆112Updated last year
- ☆99Updated 5 months ago
- Code repository to create the MIMIC-CDM Dataset.☆41Updated 10 months ago
- This project implements a RAG (Retrieval-Augmented Generation) system using an open-source stack. It utilizes BioMistral 7B as the main m…☆36Updated last year
- Medical Hallucination in Foundation Models and Their Impact on Healthcare (2025)☆76Updated 2 months ago
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆118Updated last year
- Clinical text summarization by adapting large language models☆154Updated last year
- Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making☆221Updated last year
- Med-BERT, contextualized embedding model for structured EHR data☆311Updated last year
- Curated papers on Large Language Models in Healthcare and Medical domain☆378Updated 7 months ago