data61 / blocklib
Python implementations of record linkage blocking techniques.
☆19Updated last year
Alternatives and similar repositories for blocklib:
Users that are interested in blocklib are comparing it to the libraries listed below
- CLK hash: hash pii for entity matching☆47Updated this week
- Python implementation of anonymous linkage using cryptographic linkage keys☆65Updated 9 months ago
- Privacy Preserving Record Linkage Service☆26Updated last year
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- ☆13Updated 5 years ago
- A maximum-strength name parser for record linkage.☆36Updated last week
- ☆15Updated 2 years ago
- Render Jupyter Notebooks With Metaflow Cards☆25Updated 4 months ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆55Updated 2 months ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 4 months ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆30Updated 2 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Burglary prediction for mortals☆10Updated 8 months ago
- Collaboration app for sharing and reviewing jupyter notebooks☆16Updated last year
- A few end to end examples that use data-describe☆16Updated last year
- Plugin for Intake to read from SQL servers☆15Updated last year
- A financial disclosure data extraction tool.☆13Updated last year
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 4 years ago
- A Pythonic API for Amazon's States Language for defining AWS Step Functions☆8Updated 2 years ago
- An open source data analysis platform with features for users with a range of technical skills☆46Updated this week
- Uses your app logs to visualize how the data moves between the code, database, HTTP services, message queue, external storages etc.☆23Updated 10 months ago
- Abstractions for feature engineering on large graphs of tabular data.☆21Updated 3 weeks ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆23Updated 4 months ago
- My dot files in one place - extensively edited over time. Your mileage may vary☆2Updated 8 years ago
- A Flask webapp that categorizes Outlook emails using machine learning☆15Updated 9 years ago
- 📕 Writing tests, the DataMade way☆16Updated 4 years ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆23Updated last year
- Techniques for Scraping the Web in Python☆26Updated 6 years ago
- ☆15Updated 6 years ago