data61 / blocklibLinks
Python implementations of record linkage blocking techniques.
☆21Updated last year
Alternatives and similar repositories for blocklib
Users that are interested in blocklib are comparing it to the libraries listed below
Sorting:
- CLK hash: hash pii for entity matching☆47Updated 4 months ago
- A maximum-strength name parser for record linkage.☆38Updated 2 weeks ago
- Python implementation of anonymous linkage using cryptographic linkage keys☆65Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆62Updated this week
- Python wrapper for a C++ Double Metaphone☆15Updated 3 weeks ago
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated last week
- ☆48Updated last year
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆33Updated 6 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- Record matching and entity resolution at scale in Spark☆35Updated last year
- Language detection using Spacy and Fasttext☆57Updated last year
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆29Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Record Linkage ToolKit (Find and link entities)☆110Updated 2 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆46Updated 4 months ago
- Resources for tackling record linkage / deduplication / data matching problems☆125Updated last year
- Now included in rigour☆151Updated 2 weeks ago
- Copy Pandas DataFrames and HDF5 files to PostgreSQL database☆55Updated 8 months ago
- ICIJ #Fincen Files in Neo4j☆37Updated 4 years ago
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated 4 years ago
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- Extract information from XBRL files in the ESEF format☆12Updated this week
- Modeling tool like DBT to use SQL Alchemy core with a DataFrame interface like☆11Updated 2 years ago
- Quickly compare changes made to Jupyter notebooks in GitHub repositories with jupydiff!☆13Updated 2 years ago
- Repo demonstrating a Dagster pipeline to generate Neo4j Graph☆22Updated 4 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆36Updated 3 weeks ago
- Loading OpenSanctions into Neo4J and Linkurious☆30Updated 9 months ago
- Interactive notebooks containing demonstration code of the splink library☆39Updated last year