A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)
☆98Feb 15, 2026Updated 2 weeks ago
Alternatives and similar repositories for pii-codex
Users that are interested in pii-codex are comparing it to the libraries listed below
Sorting:
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆49Jun 2, 2019Updated 6 years ago
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Jan 7, 2026Updated 2 months ago
- Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub☆338Jan 5, 2024Updated 2 years ago
- Robust de-identification of medical notes using transformer architectures☆58Jun 27, 2022Updated 3 years ago
- This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire…☆266Jan 6, 2026Updated 2 months ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆46Jan 1, 2026Updated 2 months ago
- Next generation compute platform for the post-modern data stack☆25Feb 27, 2026Updated last week
- Measuring and Controlling Persona Drift in Language Model Dialogs☆21Feb 26, 2024Updated 2 years ago
- In browser active learning and guided search☆17May 6, 2023Updated 2 years ago
- Streamlit application to explore Snowflake Tables☆50Oct 28, 2023Updated 2 years ago
- Infraless Database over any s3 storage API.☆21Mar 23, 2024Updated last year
- Qdrant operator creates and manages Qdrant clusters running in Kubernetes☆24Apr 10, 2024Updated last year
- An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data…☆7,068Updated this week
- The sane way of building a data layer in Airflow☆24Dec 5, 2019Updated 6 years ago
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24May 8, 2024Updated last year
- Cloudformation and SQL scripts used to replicate a POC environment from the "Data Lake to Data Warehouse: Enhancing Customer 360 with Ama…☆31Feb 20, 2020Updated 6 years ago
- A Python package to scrub PII☆24Apr 21, 2023Updated 2 years ago
- Terraform module which creates Snowflake RBAC resources using a simple configuration model. DISCLAIMER: Please see the following module t…☆12Jul 3, 2023Updated 2 years ago
- ☆24Feb 2, 2026Updated last month
- [Findings of ACL 2022] Meta-Path Guided Contrastive Learning for Logical Reasoning of Text☆28Mar 21, 2022Updated 3 years ago
- A Pub/Sub for Tables based data integration platform, to discover, publish, modify and consume data effortlessly.☆38Feb 25, 2026Updated last week
- Search for PII in Python☆31Jan 29, 2024Updated 2 years ago
- This Repo contains all the topics that will help you to pass the examination☆18Oct 23, 2022Updated 3 years ago
- A local first persistent log☆36Sep 14, 2025Updated 5 months ago
- ☆14Aug 2, 2024Updated last year
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Open source project to help the Web3 community fight frauds and scams.☆18Feb 7, 2024Updated 2 years ago
- Simple example on how to use AWS Step Functions, to integrate multiple AWS Lambda functions.☆35Oct 18, 2018Updated 7 years ago
- Automated Continuous Data Quality Measurement☆12Nov 15, 2023Updated 2 years ago
- A comprehensive tool for capturing performance metrics and workload snapshots, and generating in-depth comparison reports for Amazon Auro…☆19Updated this week
- Simple python script that converts all Excel files (xls, xlsx, xlsm, csv) in a directory into xlsb files.☆10Mar 13, 2023Updated 2 years ago
- Finds linguistic patterns effortlessly☆39Aug 29, 2023Updated 2 years ago
- ☆17Apr 4, 2025Updated 11 months ago
- CSC 424 Advanced Database Management Systems☆16Jan 1, 2020Updated 6 years ago
- Prosimos Simulation Engine (CLI)☆10Dec 1, 2025Updated 3 months ago
- Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching" (COLING 2025)☆19Jan 3, 2026Updated 2 months ago
- ☆14Feb 19, 2024Updated 2 years ago
- Provides the spine and skeleton framework for the WSU Web in WordPress☆10Feb 16, 2024Updated 2 years ago
- Copy data from Azure Blob Storage to Amazon S3 using code. View Azure costs using Amazon QuickSight☆16Feb 23, 2026Updated last week