MIT-LCP / bloatectomy
A python package for removing duplicate text in clinical notes or other documents
☆36Updated 4 years ago
Alternatives and similar repositories for bloatectomy:
Users that are interested in bloatectomy are comparing it to the libraries listed below
- FlexIble Data-Driven pipeLinE – a preprocessing pipeline that transforms structured EHR data into feature vectors to be used with ML algo…☆92Updated 9 months ago
- ☆29Updated 3 years ago
- Quick script to parse out medications from discharge summaries in MIMIC format. Not that this approach uses minimal NLP, and can be vatly…☆29Updated 7 years ago
- Patient Code & Text Representation Learning☆20Updated last year
- Weak supervision methods for extracting real world evidence from EHRs☆33Updated 4 years ago
- Schema definitions and Python types for Medical Event Data Standard, a standard for medical event data such as EHR and claims data☆50Updated 2 weeks ago
- Blog post on Medium☆49Updated 2 years ago
- Code for BEHRT: Transformer for Electronic Health Records☆109Updated last year
- Phe2vec: Automated Disease Phenotyping based on Unsupervised Embeddings from Electronic Health Records☆24Updated 4 years ago
- ☆59Updated last year
- ☆38Updated 3 years ago
- Code for doing machine learning with various EHRs☆21Updated 2 years ago
- A Python Natural Language Processing Toolkit for Medical Text Generation☆76Updated 5 months ago
- A simple set of MEDS polars-based ETL and transformation functions☆29Updated this week
- My heuristic script for sentence tokenization of mimic notes☆8Updated 7 years ago
- A collection of ETLs from common data formats to Medical Event Data Standard☆29Updated last week
- MIMIC (Medical Information Mart for Intensive Care) is a large, single-center database comprising information relating to patients admitt…☆76Updated last week
- A comprehensive NLP preprocessing package for clinical notes sentence boundary detection, tokenization☆30Updated 10 months ago
- ☆22Updated 3 years ago
- Targeted-BEHRT: Deep Learning for Observational Causal Inference on Longitudinal Electronic Health Records☆20Updated 2 years ago
- Dataset containing 7,025 discharge summary notes from the MIMIC III dataset annotated for 7 SBDHs☆14Updated 2 years ago
- Large Language Models to Identify Social Determinants of Health in Electronic Health Records | Paper: https://www.nature.com/articles/s41…☆43Updated last year
- Mapping the MIMIC-III database to the OMOP schema☆132Updated last year
- Deep Generative Modelling of Patient Timelines using Electronic Health Records☆57Updated last year
- Installing MIMIC-IV in a local Postgres database☆40Updated 3 years ago
- Implementation of Deep Patient Representation of Clinical Notes at Intensive Care Unit for Multi-Task Prediction☆32Updated 5 years ago
- Python package for machine learning for healthcare using a OMOP common data model☆108Updated last year
- circEWS public code☆67Updated 5 months ago
- AmsterdamUMCdb - Freely Accessible ICU database. Please access our Open Access manuscript at https://doi.org/10.1097/CCM.0000000000004916☆173Updated 2 months ago
- The project was to build and release the first publicly available code evidence dataset called MDACE on a subset of the MIMIC-III clinica…☆27Updated last year