A package to build an end-to-end pipeline for detecting personally identifiable information from text.
☆49Jun 2, 2019Updated 6 years ago
Alternatives and similar repositories for piidetect
Users that are interested in piidetect are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Jan 7, 2026Updated 3 months ago
- A project to build a machine learning pipeline to detect personal identifiable information (PII)☆16Dec 8, 2022Updated 3 years ago
- Library for identification, anonymization and de-anonymization of PII data☆22Dec 26, 2022Updated 3 years ago
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆18Updated this week
- This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…☆273Mar 30, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources☆18Jan 1, 2026Updated 3 months ago
- A iHub Summer 2015 project☆10Sep 7, 2015Updated 10 years ago
- Repository for Data Engineering Zoomcamp 2024☆14Mar 25, 2024Updated 2 years ago
- Search for PII in Python☆30Jan 29, 2024Updated 2 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆29Jul 7, 2022Updated 3 years ago
- Solution to setup a recurring Security Hub CSV full report with email notification to provide detailed report of the security posture.☆23Nov 11, 2025Updated 5 months ago
- This repository contains comprehensive hands-on exercises designed to help you explore and master various AWS Application services.☆11Sep 28, 2023Updated 2 years ago
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24May 8, 2024Updated last year
- ☆13Jan 28, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Curated collection of DE1's favorite kedro pieces.☆12Apr 5, 2024Updated 2 years ago
- ☆22Oct 27, 2025Updated 5 months ago
- Robust de-identification of medical notes using transformer architectures☆59Jun 27, 2022Updated 3 years ago
- ☆38Nov 13, 2025Updated 5 months ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python. ➡️ The project has moved to: https://gitlab.opencode…☆21Mar 20, 2026Updated 3 weeks ago
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognition☆14Jun 28, 2023Updated 2 years ago
- CLK hash: hash pii for entity matching☆47May 12, 2025Updated 11 months ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18May 5, 2021Updated 4 years ago
- ☆11Jul 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Homework assignments for ISYE 6740 Computational Data Analysis (Spring 2022)☆13Sep 21, 2022Updated 3 years ago
- R - Fetch, build and deploy.☆12Jul 25, 2023Updated 2 years ago
- ☆11Nov 11, 2023Updated 2 years ago
- AWS Amplify project to demonstrate Amazon Connect Chat with realtime language detection and translation☆17Updated this week
- English-French MT dialogue dataset☆17Apr 29, 2022Updated 3 years ago
- PyTorch implementation of the paper: Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding. Su Zhu, Ruish…☆18Nov 10, 2021Updated 4 years ago
- A comprehensive tool for capturing performance metrics and workload snapshots, and generating in-depth comparison reports for Amazon Auro…☆20Apr 9, 2026Updated last week
- CMU dictionary in IPA instead of their subset of Arpabet☆16Sep 24, 2024Updated last year
- Lambda Chaos Engineering without changing code☆13Jan 8, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Copy data from Azure Blob Storage to Amazon S3 using code. View Azure costs using Amazon QuickSight☆16Mar 5, 2026Updated last month
- This Repo contains all the topics that will help you to pass the examination☆18Oct 23, 2022Updated 3 years ago
- ☆24Jan 10, 2024Updated 2 years ago
- Source code for 'Pro Spark Streaming' by Zubair Nabi☆11Mar 27, 2017Updated 9 years ago
- ☆12Aug 5, 2024Updated last year
- An attempt to develop standards for PII redaction.☆17Mar 9, 2021Updated 5 years ago
- Utility for generating html elements with tagged`template literal`. Only 649 bytes.☆12Sep 25, 2024Updated last year