data61 / blocklibLinks
Python implementations of record linkage blocking techniques.
☆21Updated 2 years ago
Alternatives and similar repositories for blocklib
Users that are interested in blocklib are comparing it to the libraries listed below
Sorting:
- CLK hash: hash pii for entity matching☆47Updated 5 months ago
- Python implementation of anonymous linkage using cryptographic linkage keys☆67Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆63Updated last week
- Python wrapper for a C++ Double Metaphone☆15Updated this week
- Record matching and entity resolution at scale in Spark☆35Updated last year
- Scalable String Similarity Joins in Python☆39Updated last year
- A maximum-strength name parser for record linkage.☆38Updated last month
- Privacy Preserving Record Linkage Service☆26Updated 2 years ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆30Updated 2 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆125Updated last year
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated 2 weeks ago
- Trying to generate name synonyms from wikidata☆34Updated 5 years ago
- Record Linkage ToolKit (Find and link entities)☆109Updated 2 years ago
- A selection of business datasets☆18Updated 6 years ago
- A small Python module containing quick utility functions for standard ETL processes.☆36Updated last month
- ☆48Updated last year
- ☆17Updated 7 years ago
- Python library for MIME type parsing, normalisation and grouping.☆13Updated 11 months ago
- Scraping Assisted by Learning☆35Updated 3 weeks ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.☆21Updated 3 years ago
- This project provides an example of consolidating Milvus (vector search engine) and PostgreSQL (relational database) to carry out the hyb…☆11Updated 4 years ago
- Advanced similarity and duplicate source code at scale.☆56Updated 6 years ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆156Updated 2 years ago
- Talk "Beyond pandas: The great Python dataframe showdown"☆37Updated 3 years ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆33Updated 6 years ago
- Extract information from XBRL files in the ESEF format☆12Updated this week
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆80Updated 7 months ago