data61 / clkhashLinks
CLK hash: hash pii for entity matching
☆47Updated 4 months ago
Alternatives and similar repositories for clkhash
Users that are interested in clkhash are comparing it to the libraries listed below
Sorting:
- Python implementation of anonymous linkage using cryptographic linkage keys☆65Updated last year
- Privacy Preserving Record Linkage Service☆26Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- A maximum-strength name parser for record linkage.☆38Updated 2 weeks ago
- Python wrapper for a C++ Double Metaphone☆15Updated 3 weeks ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 4 years ago
- Framework for processing data packages in pipelines of modular components.☆121Updated 2 months ago
- Open Source Proxy Demographic module written in Python☆36Updated last year
- Python implementations of record linkage blocking techniques.☆21Updated last year
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆117Updated 9 months ago
- Resources for tackling record linkage / deduplication / data matching problems☆125Updated last year
- Python I/O extras☆18Updated 2 years ago
- A workshop on data privacy methods for data scientists.☆71Updated 3 years ago
- A Python library for working with Data Packages.☆191Updated last year
- ☆74Updated last year
- Python for people data☆69Updated last year
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated 4 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 5 years ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆83Updated 3 years ago
- ☆271Updated last year
- Fast, flexible name matching for large datasets☆72Updated 2 weeks ago
- A hands-on tutorial showing how to use Python to do anonymisation with synthetic data☆79Updated 3 years ago
- A Python library for working with Table Schema.☆264Updated 10 months ago
- Interactive notebooks containing demonstration code of the splink library☆39Updated last year
- variations of the record linkage model of Steorts et al. AISTATS 2014's "SMERED: A Bayesian Approach to Graphical Record Linkage and De-d…☆26Updated 8 years ago
- python library for automated dataset normalization☆116Updated 2 years ago
- The SQL/Ibis powered sklearn of record linkage☆19Updated this week
- The ONS Big Data Team Github pages☆10Updated 4 years ago
- pyfpds is a python wrapper around the FPDS ATOM feed☆13Updated 6 years ago