PovertyAction / PII_detection
Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets.
☆44Updated 3 years ago
Alternatives and similar repositories for PII_detection:
Users that are interested in PII_detection are comparing it to the libraries listed below
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆44Updated 5 years ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- CLK hash: hash pii for entity matching☆47Updated 2 weeks ago
- A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)☆85Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆57Updated 3 months ago
- ☆12Updated last year
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆21Updated 2 years ago
- Library for identification, anonymization and de-anonymization of PII data☆22Updated 2 years ago
- An infrastructure as code approach to deploying Snowflake using Terraform☆25Updated last year
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆44Updated 8 months ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆16Updated last year
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆23Updated 5 months ago
- dotML is a light-weight semantic layer written in Python.☆34Updated last year
- A hands-on tutorial showing how to use Python to do anonymisation with synthetic data☆79Updated 2 years ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆24Updated 5 months ago
- ☆37Updated last month
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.☆80Updated last month
- Fully unit tested utility functions for data engineering. Python 3 only.☆15Updated 7 months ago
- ☆26Updated 4 years ago
- ☆43Updated last year
- Using the Parquet file format with Python☆15Updated last year
- ☆20Updated 4 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆34Updated this week
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆31Updated 3 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- a convenient way to anonymize your data for analytics☆22Updated 3 years ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 2 years ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- Slides, videos and other potentially useful artifacts from various presentations on responsible machine learning.☆22Updated 5 years ago