Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets.
☆47Jan 7, 2026Updated 3 months ago
Alternatives and similar repositories for PII_detection
Users that are interested in PII_detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A CLI for identifying potential Personally Identifiable Information in datasets.☆14Apr 9, 2019Updated 7 years ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆46Jan 1, 2026Updated 3 months ago
- This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…☆273Mar 30, 2026Updated last week
- Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub☆338Jan 5, 2024Updated 2 years ago
- Search for PII in Python☆31Jan 29, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Library for identification, anonymization and de-anonymization of PII data☆22Dec 26, 2022Updated 3 years ago
- Open Privacy Vault - Secure, Performant, Open Source PII as a Service.☆51May 1, 2024Updated last year
- Testing some ideas in the Python playground.☆11Sep 23, 2022Updated 3 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆29Jul 7, 2022Updated 3 years ago
- In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO,…☆11Jun 27, 2023Updated 2 years ago
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…☆104Aug 13, 2024Updated last year
- A tampermonkey / greasemonkey tool to download Scridb.com content☆14Mar 30, 2022Updated 4 years ago
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24May 8, 2024Updated last year
- ☆13Jan 28, 2024Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Notes on time series forecasting☆16Mar 8, 2020Updated 6 years ago
- Asynchronous file handers for Python's logging☆15Jul 22, 2017Updated 8 years ago
- An application that open source projects can use to ensure they include relevant documentation (and not secrets or PII!)☆10Mar 29, 2021Updated 5 years ago
- this repo might get accepted☆28Feb 7, 2021Updated 5 years ago
- ☆12Dec 7, 2025Updated 4 months ago
- Strategies to deploy deep learning models☆27Jul 18, 2018Updated 7 years ago
- Stata module for fast wild bootstrap-based inference. Releases posted here are appropriate for use, and are usually posted promptly on SS…☆13Oct 8, 2025Updated 6 months ago
- Example Code to Supplement the Label Studio Blog☆33Jan 6, 2026Updated 3 months ago
- CLK hash: hash pii for entity matching☆48May 12, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Content for a talk on "The wonderful world of data quality tools in Python"☆18May 5, 2021Updated 4 years ago
- A scribd-downloader that actually works☆25Aug 17, 2017Updated 8 years ago
- Structural Time Series on US electricity demand data☆22Jan 12, 2021Updated 5 years ago
- Scripts to demonstrate VPC Service Controls between tenant and shared projects☆12Jun 11, 2019Updated 6 years ago
- Sniper. Passive Secrets Hunting.🚬☆13Jun 3, 2022Updated 3 years ago
- ☆11Nov 11, 2023Updated 2 years ago
- AWS Amplify project to demonstrate Amazon Connect Chat with realtime language detection and translation☆17Apr 2, 2026Updated last week
- In an effort to decrease the execution time of the OCR process, a multi-processing script was created using Python's multi-processing mod…☆10Dec 6, 2019Updated 6 years ago
- Install applications and development environment on a macOS or Linux machine.☆14Updated this week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- SuiteCRM Docker Compose Script☆28Oct 9, 2025Updated 6 months ago
- Scanner to send specially crafted requests and catch callbacks of systems that are impacted by log4j log4shell vulnerability and to detec…☆12Feb 15, 2022Updated 4 years ago
- Cluster doctor skills☆14Feb 20, 2026Updated last month
- Open source software for machine learning production monitoring : maintain control over production models, detect bias, explain your resu…☆21Mar 3, 2023Updated 3 years ago
- A Python program to scrape secrets from GitHub through usage of a large repository of dorks.☆14Jan 28, 2021Updated 5 years ago
- Ansible role to install AWS EC2 Systems Manager☆14Dec 1, 2022Updated 3 years ago
- A tool to run nmap against each line in a script.☆17Jan 3, 2021Updated 5 years ago