data61 / clkhash
CLK hash: hash pii for entity matching
☆47Updated last week
Alternatives and similar repositories for clkhash:
Users that are interested in clkhash are comparing it to the libraries listed below
- Python implementation of anonymous linkage using cryptographic linkage keys☆65Updated 10 months ago
- Privacy Preserving Record Linkage Service☆26Updated 2 years ago
- Python implementations of record linkage blocking techniques.☆20Updated last year
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A maximum-strength name parser for record linkage.☆36Updated last month
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- A browser user interface for manual labeling of record pairs.☆45Updated last year
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Framework for processing data packages in pipelines of modular components.☆121Updated last month
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- ☆43Updated 2 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆122Updated last year
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 6 months ago
- Ansible Superset Role☆16Updated 4 years ago
- Convert a CSV to a parquet file.☆64Updated 2 years ago
- variations of the record linkage model of Steorts et al. AISTATS 2014's "SMERED: A Bayesian Approach to Graphical Record Linkage and De-d…☆27Updated 8 years ago
- ☆13Updated 5 years ago
- Stanford Entity-Resolution Framework☆23Updated 6 years ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆79Updated last year
- DataOps for Government☆34Updated 6 years ago
- Machine assisted dossiers☆19Updated 7 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated last week
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Carles Pina Estany's 2020 Tool Fund: data managers and researchers collaborate to write the Frictionless Data packages, tabular schemas, …☆16Updated 2 years ago
- Demonstration of how dedupe might be used as geocoder☆17Updated 2 years ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆83Updated 3 years ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other do…☆10Updated last year
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Updated 4 years ago