data61 / anonlink-entity-service
Privacy Preserving Record Linkage Service
☆26Updated last year
Related projects ⓘ
Alternatives and complementary repositories for anonlink-entity-service
- CLK hash: hash pii for entity matching☆47Updated last year
- Python implementation of anonymous linkage using cryptographic linkage keys☆63Updated 6 months ago
- Python implementations of record linkage blocking techniques.☆19Updated last year
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- A maximum-strength name parser for record linkage.☆34Updated 3 months ago
- ☆13Updated 5 years ago
- Copy Pandas DataFrames and HDF5 files to PostgreSQL database☆54Updated 3 months ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 9 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆35Updated 2 months ago
- A small Python module containing quick utility functions for standard ETL processes.☆33Updated last week
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- Pipeline Explorer - Explore and analyze millions of pipelines learned using MLBlocks and MLPrimitives.☆17Updated last year
- A Singer.io Target for the Stitch Import API☆26Updated this week
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 5 years ago
- Enhance your feature engineering workflow with Kodiak☆20Updated last year
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 5 years ago
- Stanford Entity-Resolution Framework☆23Updated 6 years ago
- Python classes for data manipulation☆25Updated last month
- Privacy preserving synthetic data generation workflows☆20Updated 2 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆112Updated 9 months ago
- Fork of the Freely Extensible Biomedical Record Linkage program☆24Updated 8 years ago
- An engine for fast time series data aggregation☆12Updated 5 years ago
- ☆10Updated 4 years ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Au…☆42Updated 3 years ago
- Dexter document monitor for MMA☆17Updated 6 months ago
- Sklearn transformers that work with Pandas dataframes☆11Updated 4 years ago
- Scalable String Similarity Joins in Python☆39Updated 4 months ago
- Quickly compare changes made to Jupyter notebooks in GitHub repositories with jupydiff!☆13Updated last year