A Python library to de-identify medical records with state-of-the-art NLP methods.
☆147Nov 17, 2025Updated 6 months ago
Alternatives and similar repositories for deidentify
Users that are interested in deidentify are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Robust de-identification of medical notes using transformer architectures☆60Jun 27, 2022Updated 3 years ago
- Deduce: de-identification method for Dutch medical text☆65Feb 10, 2026Updated 3 months ago
- init☆13Dec 4, 2024Updated last year
- Tools for de-identifying medical records on Google Cloud Platform.☆50Jan 23, 2020Updated 6 years ago
- Medical Text Mining and Information Extraction with spaCy☆437Nov 1, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This is an R package that implements a library of standard queries that run against the OMOP-CDM.☆18Jun 7, 2024Updated last year
- Library for clinical NLP with spaCy.☆653Mar 31, 2026Updated last month
- System for Medical Concept Extraction and Linking☆442Aug 12, 2024Updated last year
- ☆17Apr 22, 2026Updated 3 weeks ago
- A deidentifier / deidentification pipeline developed by Stanford and Penn as part of the MIDRC organization.☆106Nov 24, 2025Updated 5 months ago
- ☆11Nov 19, 2020Updated 5 years ago
- Code to study the generalisability of benchmark models on non-stationary EHRs.☆15Aug 7, 2019Updated 6 years ago
- Transformers for Clinical NLP☆29Apr 17, 2026Updated last month
- 👥 An R package for deidentifying datasets that may contain personally identifiable information (PII)☆30Feb 12, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆223Dec 11, 2024Updated last year
- Clinical NLP workshop for ODSC☆41Oct 31, 2019Updated 6 years ago
- A Python Natural Language Processing Toolkit for Medical Text Generation☆84May 24, 2025Updated 11 months ago
- Anonymize Medical Documents using LLMs☆68Sep 2, 2024Updated last year
- MedType: Improving Medical Entity Linking with Semantic Type Prediction☆114Feb 10, 2023Updated 3 years ago
- Bidirectional Encoder Representations from Transformers (BERT) transfer learning for named entity recognition and de-identification of se…☆10Aug 3, 2019Updated 6 years ago
- spaCy pipeline object for negating concepts in text☆282Apr 20, 2026Updated last month
- Clinical Natural Language Processing using spaCy, scispacy, and medspacy☆102Apr 24, 2024Updated 2 years ago
- Code and Word2Vec embeddings of LOINC codes for KDD 2019 DSHealth paper "Evaluation of Embeddings of Laboratory Test Codes for Patients a…☆11Jun 13, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GERNERMED++ is a transfer-learning-based open neural NER model for medical entities designed for German data.☆10Oct 20, 2023Updated 2 years ago
- Open source clinical text de-identification☆155Aug 20, 2024Updated last year
- A full spaCy pipeline and models for scientific/biomedical documents.☆1,958Dec 4, 2025Updated 5 months ago
- A curated list of awesome resources at the intersection of healthcare and AI☆73Sep 22, 2023Updated 2 years ago
- Privacy-preserving representations of training data for de-identification☆17Apr 21, 2026Updated last month
- Medical Concept Annotation Tool☆531Jul 25, 2025Updated 9 months ago
- A corpus of Biomedical papers annotated with mentions of UMLS entities.☆344Nov 9, 2021Updated 4 years ago
- A web-based version of the codebook, which generates a concise summary of every variable in a dataset.☆14Apr 9, 2022Updated 4 years ago
- This is a repository for annotation data for the THYME Project, a clinical natural language processing project dedicated to extracting us…☆36May 21, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆34Mar 12, 2024Updated 2 years ago
- [Under development] Assessment of Pre-trained Observational Large Language-models in OHDSI (APOLLO)☆14Jun 18, 2024Updated last year
- Oncology Working Group Repository☆62May 8, 2026Updated 2 weeks ago
- ☆22Dec 8, 2022Updated 3 years ago
- Code for doing machine learning with various EHRs☆22Mar 1, 2023Updated 3 years ago
- FlexIble Data-Driven pipeLinE – a preprocessing pipeline that transforms structured EHR data into feature vectors to be used with ML algo…☆100Mar 19, 2026Updated 2 months ago
- ☆18Oct 30, 2020Updated 5 years ago