edwardcooper/piidetect

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/edwardcooper/piidetect)

edwardcooper / piidetect

A package to build an end-to-end pipeline for detecting personally identifiable information from text.

☆50

Alternatives and similar repositories for piidetect

Users that are interested in piidetect are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PovertyAction / PII_detection
View on GitHub
Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…
☆47Jan 7, 2026Updated 6 months ago
EdyVision / pii-codex
View on GitHub
A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)
☆101Feb 15, 2026Updated 5 months ago
fvaleye / metadata-guardian
View on GitHub
Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️
☆18Updated this week
apicrafter / metacrafter
View on GitHub
Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…
☆46Jul 8, 2026Updated 2 weeks ago
Poogles / piiregex
View on GitHub
Search for PII in Python
☆29Jan 29, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
aws-samples / amazon-connect-data-analytics-sample
View on GitHub
☆21Oct 17, 2023Updated 2 years ago
aws-samples / aws-amplify-cloud-assistant-app
View on GitHub
☆13Jan 28, 2024Updated 2 years ago
dataengineerone / de1-python
View on GitHub
Curated collection of DE1's favorite kedro pieces.
☆12Apr 5, 2024Updated 2 years ago
librespacefoundation / upsat-comms-software
View on GitHub
COMMS Software for UPSat
☆12Dec 17, 2018Updated 7 years ago
obi-ml-public / ehr_deidentification
View on GitHub
Robust de-identification of medical notes using transformer architectures
☆63Jun 27, 2022Updated 4 years ago
openredact / nerwhal
View on GitHub
This is a prototype of a multi-lingual suite for named-entity recognition in Python. ➡️ The project has moved to: https://gitlab.opencode…
☆21Mar 20, 2026Updated 4 months ago
apicrafter / datacrafter
View on GitHub
NoSQL extract, transform, load (ETL) toolkit with Python
☆16Jul 17, 2026Updated last week
data61 / clkhash
View on GitHub
CLK hash: hash pii for entity matching
☆47May 12, 2025Updated last year
kxcloud / gradient-routing
View on GitHub
☆11Dec 4, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
spbail / data-quality-tools
View on GitHub
Content for a talk on "The wonderful world of data quality tools in Python"
☆18May 5, 2021Updated 5 years ago
rahulblr2016 / AWS-Certified-Solutions-Architect---Associate
View on GitHub
This Repo contains all the topics that will help you to pass the examination
☆18Oct 23, 2022Updated 3 years ago
tdoehmen / gitschemas
View on GitHub
☆11Jul 20, 2023Updated 3 years ago
dustinlacewell / polybrain.el
View on GitHub
Polymode support for Org-brain
☆14May 10, 2020Updated 6 years ago
inahpatrizia / isye_6740
View on GitHub
Homework assignments for ISYE 6740 Computational Data Analysis (Spring 2022)
☆14Sep 21, 2022Updated 3 years ago
aws-samples / aws-data-exporter
View on GitHub
☆11Nov 11, 2023Updated 2 years ago
veritross / studiosr
View on GitHub
PyTorch library to accelerate super-resolution research
☆11Jun 23, 2024Updated 2 years ago
tychovdo / noethers-razor
View on GitHub
Code for NeurIPS 2024 paper: "Noether's razor: Learning Conserved Quantities" by Tycho F. A. van der Ouderaa, Mark van der Wilk, Pim de H…
☆11Oct 12, 2024Updated last year
sz128 / few_shot_slot_tagging_and_NER
View on GitHub
PyTorch implementation of the paper: Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding. Su Zhu, Ruish…
☆18Nov 10, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
OsamaJBR / teach-me-aws-stepfunctions
View on GitHub
Simple example on how to use AWS Step Functions, to integrate multiple AWS Lambda functions.
☆35Oct 18, 2018Updated 7 years ago
aws-samples / serverless-chaos-extension
View on GitHub
Lambda Chaos Engineering without changing code
☆12Jan 8, 2025Updated last year
pdm-project / pdm-autoexport
View on GitHub
A PDM plugin to sync the exported files with the project file
☆15Sep 6, 2025Updated 10 months ago
business-science / shinyauth
View on GitHub
Dockerfile
☆10Feb 5, 2024Updated 2 years ago
ShuguangSun / python-view-data
View on GitHub
View data in Python
☆13Apr 13, 2024Updated 2 years ago
grettke / maccadet
View on GitHub
Use all of Emacs modifiers on macOS with various keyboards.
☆11Jul 22, 2024Updated 2 years ago
google-github-actions / github-workflow-job-to-pubsub
View on GitHub
Fulfills a GitHub workflow_job webhooks into a Pub/Sub queue.
☆12Mar 13, 2025Updated last year
Paulescu / text-embedding-evaluation
View on GitHub
Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️
☆19Apr 19, 2024Updated 2 years ago
dustinlacewell / org-spacer.el
View on GitHub
Enforce the number of blank lines between elements in an org-mode document
☆18Mar 24, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aws-samples / natural-language-querying-of-data-in-s3-with-athena-and-generative-ai-text-to-sql
View on GitHub
☆15Aug 2, 2024Updated last year
Apress / pro-spark-streaming
View on GitHub
Source code for 'Pro Spark Streaming' by Zubair Nabi
☆11Mar 27, 2017Updated 9 years ago
aws-samples / deploy-datahub-using-aws-managed-services-ingest-metadata
View on GitHub
☆12Aug 5, 2024Updated last year
yogihbti / ccfdHMM
View on GitHub
Credit Card Fraud Detection using HMM ( Hidden Markow Model)
☆12Nov 2, 2017Updated 8 years ago
aws-samples / amazon-connect-global-resiliency
View on GitHub
Starter project to create a dashboard to interact with Amazon Connect Global Resiliency APIs
☆15Feb 20, 2024Updated 2 years ago
DanielSWolf / wiki-pronunciation-dict
View on GitHub
Pronunciation dictionaries for several languages, based on Wiktionary data.
☆21Nov 28, 2021Updated 4 years ago
aws-samples / amazon-textract-analyze-expense-processing-pipeline
View on GitHub
☆15Jul 14, 2026Updated last week