data61 / anonlink
Python implementation of anonymous linkage using cryptographic linkage keys
☆65Updated 10 months ago
Alternatives and similar repositories for anonlink:
Users that are interested in anonlink are comparing it to the libraries listed below
- CLK hash: hash pii for entity matching☆47Updated last week
- Privacy Preserving Record Linkage Service☆26Updated 2 years ago
- Python implementations of record linkage blocking techniques.☆20Updated last year
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A maximum-strength name parser for record linkage.☆36Updated last month
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- Framework for processing data packages in pipelines of modular components.☆121Updated last month
- Resources for tackling record linkage / deduplication / data matching problems☆122Updated last year
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆79Updated last year
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆83Updated 3 years ago
- Stanford Entity-Resolution Framework☆23Updated 6 years ago
- A browser user interface for manual labeling of record pairs.☆45Updated last year
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Carles Pina Estany's 2020 Tool Fund: data managers and researchers collaborate to write the Frictionless Data packages, tabular schemas, …☆16Updated 2 years ago
- Convert a CSV to a parquet file.☆64Updated 2 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated last week
- Demonstration of how dedupe might be used as geocoder☆17Updated 2 years ago
- Parser for U.S. federal regulations and other regulatory information☆39Updated last year
- LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record link…☆31Updated 2 years ago
- SQLAlchemy models and DDL and ERD generation from chop-dbhi/data-models style JSON endpoints.☆11Updated last year
- ☆15Updated this week
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆62Updated 5 months ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- Data validation as a service. Project retired, got to the current one at frictionsless/repository☆69Updated 2 years ago
- @vega transforms with @ibis-project expressions☆29Updated 3 years ago
- variations of the record linkage model of Steorts et al. AISTATS 2014's "SMERED: A Bayesian Approach to Graphical Record Linkage and De-d…☆27Updated 8 years ago
- A Python library for working with Data Packages.☆192Updated last year
- Data cleaning and validation functions for names, languages, identifiers, etc.☆19Updated this week