data61 / anonlink
Python implementation of anonymous linkage using cryptographic linkage keys
☆65Updated 11 months ago
Alternatives and similar repositories for anonlink:
Users that are interested in anonlink are comparing it to the libraries listed below
- CLK hash: hash pii for entity matching☆47Updated last week
- Privacy Preserving Record Linkage Service☆26Updated 2 years ago
- Python implementations of record linkage blocking techniques.☆20Updated last year
- A maximum-strength name parser for record linkage.☆37Updated this week
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Convert a CSV to a parquet file.☆64Updated 2 years ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆82Updated 3 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆123Updated last year
- Demo from NEO4j's Connections: Healthcare & Life Sciences event☆11Updated 4 years ago
- @vega transforms with @ibis-project expressions☆29Updated 4 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- Framework for processing data packages in pipelines of modular components.☆121Updated 3 months ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆80Updated last year
- A conda-smithy repository for python-duckdb.☆13Updated 3 weeks ago
- variations of the record linkage model of Steorts et al. AISTATS 2014's "SMERED: A Bayesian Approach to Graphical Record Linkage and De-d…☆27Updated 8 years ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆23Updated 2 years ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 6 years ago
- Python Driver for Apache Drill.☆59Updated 2 years ago
- The SQL/Ibis powered sklearn of record linkage☆15Updated 2 weeks ago
- (Archived) A Python library for record linkage and deduplication.☆19Updated last year
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago
- SQLAlchemy models and DDL and ERD generation from chop-dbhi/data-models style JSON endpoints.☆11Updated last year
- ☆16Updated this week
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- ☆13Updated 6 years ago
- Tools for massively parallel and multi-variate data exploration☆39Updated last year
- An open source data analysis platform with features for users with a range of technical skills☆46Updated this week
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆19Updated 2 years ago
- The Python API for MonetDB☆28Updated 3 months ago