A package to build an end-to-end pipeline for detecting personally identifiable information from text.
☆49Jun 2, 2019Updated 6 years ago
Alternatives and similar repositories for piidetect
Users that are interested in piidetect are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Jan 7, 2026Updated 2 months ago
- A project to build a machine learning pipeline to detect personal identifiable information (PII)☆16Dec 8, 2022Updated 3 years ago
- Library for identification, anonymization and de-anonymization of PII data☆22Dec 26, 2022Updated 3 years ago
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆98Feb 15, 2026Updated last month
- A Python package to scrub PII☆24Apr 21, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A personally identifiable information (PII) filter.☆10May 28, 2021Updated 4 years ago
- This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…☆268Mar 2, 2026Updated 3 weeks ago
- Repository for Data Engineering Zoomcamp 2024☆14Mar 25, 2024Updated 2 years ago
- Search for PII in Python☆31Jan 29, 2024Updated 2 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆29Jul 7, 2022Updated 3 years ago
- Solution to setup a recurring Security Hub CSV full report with email notification to provide detailed report of the security posture.☆23Nov 11, 2025Updated 4 months ago
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24May 8, 2024Updated last year
- Curated collection of DE1's favorite kedro pieces.☆12Apr 5, 2024Updated last year
- An application that open source projects can use to ensure they include relevant documentation (and not secrets or PII!)☆10Mar 29, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Robust de-identification of medical notes using transformer architectures☆59Jun 27, 2022Updated 3 years ago
- A list of Presto/Trino resources☆23Aug 7, 2023Updated 2 years ago
- Open Privacy Vault - Secure, Performant, Open Source PII as a Service.☆51May 1, 2024Updated last year
- CLK hash: hash pii for entity matching☆48May 12, 2025Updated 10 months ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18May 5, 2021Updated 4 years ago
- Launch different bash configurations for Linux vs OSX, interactive vs batch☆40Aug 11, 2012Updated 13 years ago
- R - Fetch, build and deploy.☆12Jul 25, 2023Updated 2 years ago
- A Forge based Minecraft server-side plugin API☆13Nov 23, 2014Updated 11 years ago
- Hands on advanced machine learning for information extraction from tweets tasks, data, and open source tools☆14Apr 14, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A library for simple communication with Google Cloud Storage.☆12Feb 12, 2018Updated 8 years ago
- ☆15Aug 2, 2024Updated last year
- English-French MT dialogue dataset☆17Apr 29, 2022Updated 3 years ago
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…☆104Aug 13, 2024Updated last year
- ☆12Jan 25, 2024Updated 2 years ago
- PyTorch implementation of the paper: Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding. Su Zhu, Ruish…☆18Nov 10, 2021Updated 4 years ago
- A PDM plugin to sync the exported files with the project file☆15Sep 6, 2025Updated 6 months ago
- A comprehensive tool for capturing performance metrics and workload snapshots, and generating in-depth comparison reports for Amazon Auro…☆19Mar 6, 2026Updated 2 weeks ago
- How to really install tensorflow-gpu from source on a clean instance of Ubuntu☆11Sep 29, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CMU dictionary in IPA instead of their subset of Arpabet☆16Sep 24, 2024Updated last year
- ☆28Mar 4, 2026Updated 3 weeks ago
- This is the official code used for WAT 2017 Description Paper titled A Bag of Useful Tricks for Practical Neural Machine Translation: Emb…☆12Oct 24, 2017Updated 8 years ago
- ☆17Jan 14, 2013Updated 13 years ago
- Copy data from Azure Blob Storage to Amazon S3 using code. View Azure costs using Amazon QuickSight☆16Mar 5, 2026Updated 3 weeks ago
- Low poly Unity platformer. Going to get my third person camera and controller tweaked then do small mechanics.☆11Jul 18, 2015Updated 10 years ago
- A command-line tool that summarizes the size of a codebase by language, showing lines of code with and without comments and blank lines.☆51Mar 6, 2026Updated 3 weeks ago