PovertyAction / PII_detectionLinks
Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets.
☆45Updated 3 years ago
Alternatives and similar repositories for PII_detection
Users that are interested in PII_detection are comparing it to the libraries listed below
Sorting:
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆45Updated 6 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆58Updated last month
- CLK hash: hash pii for entity matching☆47Updated 3 weeks ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆44Updated 10 months ago
- Library for identification, anonymization and de-anonymization of PII data☆22Updated 2 years ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 4 years ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆23Updated 2 years ago
- ☆11Updated last year
- Fast, flexible name matching for large datasets☆72Updated 2 weeks ago
- A browser user interface for manual labeling of record pairs.☆47Updated last year
- A maximum-strength name parser for record linkage.☆37Updated last month
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆17Updated 3 weeks ago
- Abstractions for feature engineering on large graphs of tabular data.☆21Updated last week
- Tutorial code and data for the entity resolution workshops.☆45Updated 9 years ago
- A project to build a machine learning pipeline to detect personal identifiable information (PII)☆16Updated 2 years ago
- ☆26Updated 4 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆35Updated last month
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- a convenient way to anonymize your data for analytics☆22Updated 3 years ago
- Using Jupyter notebook to develop DevOps automated environment to start and stop SageMaker notebook instances out of working hours☆22Updated 6 years ago
- Basic tutorial of using Apache Airflow☆36Updated 6 years ago
- Example Multi-Cycle, Multi-Touch Revenue and Cost Attribution Model☆27Updated last year
- NLP text recommendation system built in Python using Gensim, spaCy, and Plotly Dash☆15Updated 7 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆24Updated 8 months ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Cohort extractor tool which can generate dummy data, or real data against OpenSAFELY-compliant research databases☆38Updated last month
- dbt adwords models☆18Updated 4 months ago