MIT-LCP / bloatectomyLinks
A python package for removing duplicate text in clinical notes or other documents
☆36Updated 4 years ago
Alternatives and similar repositories for bloatectomy
Users that are interested in bloatectomy are comparing it to the libraries listed below
Sorting:
- FlexIble Data-Driven pipeLinE – a preprocessing pipeline that transforms structured EHR data into feature vectors to be used with ML algo…☆93Updated 11 months ago
- Quick script to parse out medications from discharge summaries in MIMIC format. Not that this approach uses minimal NLP, and can be vatly…☆29Updated 7 years ago
- ☆29Updated 3 years ago
- ☆38Updated 3 years ago
- ☆61Updated last year
- Weak supervision methods for extracting real world evidence from EHRs☆33Updated 5 years ago
- A collection of ETLs from common data formats to Medical Event Data Standard☆29Updated last month
- Code for BEHRT: Transformer for Electronic Health Records☆110Updated 2 years ago
- Python package for machine learning for healthcare using a OMOP common data model☆108Updated last year
- Patient Code & Text Representation Learning☆20Updated 2 years ago
- Phe2vec: Automated Disease Phenotyping based on Unsupervised Embeddings from Electronic Health Records☆24Updated 4 years ago
- Large Language Models to Identify Social Determinants of Health in Electronic Health Records | Paper: https://www.nature.com/articles/s41…☆45Updated last year
- Schema definitions and Python types for Medical Event Data Standard, a standard for medical event data such as EHR and claims data☆62Updated 3 weeks ago
- Blog post on Medium☆49Updated 2 years ago
- Targeted-BEHRT: Deep Learning for Observational Causal Inference on Longitudinal Electronic Health Records☆20Updated 2 years ago
- MIMIC (Medical Information Mart for Intensive Care) is a large, single-center database comprising information relating to patients admitt…☆78Updated last week
- Implementation of Deep Patient Representation of Clinical Notes at Intensive Care Unit for Multi-Task Prediction☆32Updated 5 years ago
- My heuristic script for sentence tokenization of mimic notes☆8Updated 7 years ago
- ☆22Updated 2 years ago
- python library for graphical and continuous representations of ICD9 and ICD10 codes☆39Updated 4 months ago
- Dataset containing 7,025 discharge summary notes from the MIMIC III dataset annotated for 7 SBDHs☆14Updated 2 years ago
- Transformers for Clinical NLP☆25Updated 2 weeks ago
- A comprehensive NLP preprocessing package for clinical notes sentence boundary detection, tokenization☆31Updated last year
- Python module to parse UMLS source files☆19Updated 2 years ago
- Code for doing machine learning with various EHRs☆21Updated 2 years ago
- Deep Generative Modelling of Patient Timelines using Electronic Health Records☆62Updated last year
- Weakly supervised medical named entity classification☆73Updated 2 years ago
- ☆98Updated 3 years ago
- ☆22Updated 3 years ago
- A Python Natural Language Processing Toolkit for Medical Text Generation☆79Updated last week