data61 / blocklib
Python implementations of record linkage blocking techniques.
☆20Updated last year
Alternatives and similar repositories for blocklib:
Users that are interested in blocklib are comparing it to the libraries listed below
- CLK hash: hash pii for entity matching☆47Updated 3 weeks ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Privacy Preserving Record Linkage Service☆26Updated 2 years ago
- Python implementation of anonymous linkage using cryptographic linkage keys☆65Updated 10 months ago
- ☆13Updated 5 years ago
- A maximum-strength name parser for record linkage.☆36Updated 2 weeks ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 5 years ago
- Collaboration app for sharing and reviewing jupyter notebooks☆16Updated last year
- Abstractions for feature engineering on large graphs of tabular data.☆21Updated this week
- Graphistry admin docs: launch, configure, use, & debug☆26Updated 3 weeks ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆22Updated 2 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆24Updated 6 months ago
- A Scalable Data Cleaning Library for PySpark.☆27Updated 6 years ago
- Render Jupyter Notebooks With Metaflow Cards☆29Updated 6 months ago
- Plugin for Intake to read from SQL servers☆15Updated last year
- stemgraphic python package for visualization of data and text☆18Updated 4 years ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Streaming web crawler with WebSocket API☆44Updated last year
- A labextension to integrate pyflyby with notebooks☆12Updated 3 months ago
- Advanced data wrangling for python☆12Updated last year
- A Python package that simplifies the use of secrets in a Jupyter notebook☆21Updated 3 years ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆12Updated last year
- Burglary prediction for mortals☆10Updated 10 months ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- Cookiecutter for community-maintained Jupyter Docker images☆15Updated last week
- Statistical visualizations for Datasette using Seaborn☆12Updated 3 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆57Updated 3 months ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 6 months ago
- A markdown wiki and dashboarding system for Datasette☆21Updated 3 years ago