microsoft / clinical_visit_note_summarization_corpus
A corpus of textual data corresponding to synthetic clinical encounters, including each encounters’ dialogue transcript and clinical notes.
☆28Updated 11 months ago
Related projects: ⓘ
- Self-verification for LLMs.☆60Updated last year
- Sentence tokenizer for clinical/medical text.☆25Updated 3 months ago
- A Python Natural Language Processing Toolkit for Medical Text Generation☆66Updated this week
- A new collection of 1.7k doctor-patient conversations and corresponding clinical notes/summaries.☆49Updated 11 months ago
- A deidentifier / deidentification pipeline developed by Stanford and Penn as part of the MIDRC organization.☆75Updated 3 months ago
- The project was to build and release the first publicly available code evidence dataset called MDACE on a subset of the MIMIC-III clinica…☆21Updated last year
- ☆75Updated last month
- A Python Natural Language Processing Toolkit for Electronic Health Record Texts☆12Updated last year
- Transformers for Clinical NLP☆21Updated last week
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆87Updated 11 months ago
- ☆12Updated 3 years ago
- ☆50Updated last year
- Dataset for the NLPMC @ NAACL 2021 Paper: Assertion Detection in Clinical Notes: Medical Language Models to the Rescue?☆15Updated 2 years ago
- Official Codes for "Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes"☆84Updated 3 weeks ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆21Updated 2 years ago
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Updated 5 months ago
- BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance.☆42Updated 7 months ago
- Labeled dataset of similar and dissimilar medical question pairs created by Curai's doctors☆19Updated 4 years ago
- Medical reasoning using large language models☆83Updated 8 months ago
- We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…☆91Updated last month
- ☆41Updated 11 months ago
- Clinical text summarization by adapting large language models☆111Updated last month
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆15Updated 11 months ago
- Medical Question-Answering datasets prepared for the TREC 2017 LiveQA challenge (Medical Task)☆39Updated last year
- Apache cTAKES is a Natural Language Processing (NLP) platform for clinical text.☆46Updated this week
- Biomedical Question Answering Datasets.☆71Updated last year
- OLAPH: Improving Factuality in Biomedical Long-form Question Answering☆34Updated last week
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆67Updated 2 months ago
- For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.☆24Updated 3 months ago