MIT-LCP / bloatectomy
A python package for removing duplicate text in clinical notes or other documents
☆36Updated 4 years ago
Alternatives and similar repositories for bloatectomy:
Users that are interested in bloatectomy are comparing it to the libraries listed below
- FlexIble Data-Driven pipeLinE – a preprocessing pipeline that transforms structured EHR data into feature vectors to be used with ML algo…☆91Updated 8 months ago
- ☆28Updated 3 years ago
- Quick script to parse out medications from discharge summaries in MIMIC format. Not that this approach uses minimal NLP, and can be vatly…☆28Updated 7 years ago
- Phe2vec: Automated Disease Phenotyping based on Unsupervised Embeddings from Electronic Health Records☆24Updated 4 years ago
- Python package for machine learning for healthcare using a OMOP common data model☆107Updated last year
- Schema definitions and Python types for Medical Event Data Standard, a standard for medical event data such as EHR and claims data☆46Updated last week
- A Python Natural Language Processing Toolkit for Medical Text Generation☆76Updated 4 months ago
- Weak supervision methods for extracting real world evidence from EHRs☆33Updated 4 years ago
- Biomedical concept relatedness benchmark sampled from electronic health records☆11Updated 2 years ago
- Large Language Models to Identify Social Determinants of Health in Electronic Health Records | Paper: https://www.nature.com/articles/s41…☆42Updated last year
- Deep Generative Modelling of Patient Timelines using Electronic Health Records☆55Updated last year
- ☆57Updated last year
- A simple set of MEDS polars-based ETL and transformation functions☆28Updated this week
- Patient Code & Text Representation Learning☆20Updated last year
- Code for BEHRT: Transformer for Electronic Health Records☆109Updated last year
- MIMIC (Medical Information Mart for Intensive Care) is a large, single-center database comprising information relating to patients admitt…☆76Updated 2 weeks ago
- Blog post on Medium☆48Updated 2 years ago
- ☆38Updated 2 years ago
- A collection of ETLs from common data formats to Medical Event Data Standard☆27Updated last week
- Clinical XLNet: Modeling Sequential Clinical Notes and Predicting Prolonged Mechanical Ventilation☆48Updated 3 years ago
- Implementation of Deep Patient Representation of Clinical Notes at Intensive Care Unit for Multi-Task Prediction☆32Updated 5 years ago
- The project was to build and release the first publicly available code evidence dataset called MDACE on a subset of the MIMIC-III clinica…☆26Updated last year
- Code for doing machine learning with various EHRs☆21Updated 2 years ago
- A comprehensive NLP preprocessing package for clinical notes sentence boundary detection, tokenization☆30Updated 9 months ago
- My heuristic script for sentence tokenization of mimic notes☆8Updated 7 years ago
- [ICLR 2025] ACES: Automatic Cohort Extraction System for Event-Streams☆29Updated this week
- python library for graphical and continuous representations of ICD9 and ICD10 codes☆36Updated last month
- Experiments applying FIDDLE on MIMIC-III and eICU. https://doi.org/10.1093/jamia/ocaa139☆24Updated 2 years ago
- ☆38Updated 3 years ago
- public code repository for paper "Health system scale language models are general purpose clinical prediction engines"☆111Updated last year