data61 / blocklibLinks
Python implementations of record linkage blocking techniques.
☆21Updated 2 years ago
Alternatives and similar repositories for blocklib
Users that are interested in blocklib are comparing it to the libraries listed below
Sorting:
- CLK hash: hash pii for entity matching☆47Updated 7 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆63Updated last week
- Python implementation of anonymous linkage using cryptographic linkage keys☆70Updated last year
- A maximum-strength name parser for record linkage.☆39Updated 3 months ago
- ☆48Updated last year
- Python wrapper for a C++ Double Metaphone☆15Updated 3 weeks ago
- Record matching and entity resolution at scale in Spark☆36Updated 2 years ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆33Updated 6 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- PyPi module for Graphlet AI Knowledge Graph Factory☆33Updated 2 years ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆25Updated 3 years ago
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆32Updated 3 years ago
- Trying to generate name synonyms from wikidata☆34Updated 5 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆125Updated last year
- Loading OpenSanctions into Neo4J and Linkurious☆31Updated 11 months ago
- A browser user interface for manual labeling of record pairs.☆48Updated 2 years ago
- React UI component library for aleph/followthemoney☆12Updated 3 years ago
- Now included in rigour☆152Updated 3 weeks ago
- Interactive notebooks containing demonstration code of the splink library☆40Updated last year
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated last week
- data wrangling simplicity, complete audit transparency, and at speed☆35Updated 2 months ago
- Package that returns a company embedding given a company name☆47Updated 5 years ago
- Talk "Beyond pandas: The great Python dataframe showdown"☆37Updated 3 years ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 4 years ago
- Fuzzy Categorical Distances☆14Updated 5 years ago
- ICIJ #Fincen Files in Neo4j☆40Updated 5 years ago
- Record Linkage ToolKit (Find and link entities)☆111Updated 2 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated 2 years ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆159Updated 3 years ago
- Docker template for basic data science packages to interface with Neo4j☆14Updated 4 years ago