data61 / blocklibLinks
Python implementations of record linkage blocking techniques.
☆21Updated 2 years ago
Alternatives and similar repositories for blocklib
Users that are interested in blocklib are comparing it to the libraries listed below
Sorting:
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆66Updated 2 weeks ago
- Python implementation of anonymous linkage using cryptographic linkage keys☆70Updated last year
- Scalable String Similarity Joins in Python☆39Updated last year
- Record Linkage ToolKit (Find and link entities)☆111Updated 2 years ago
- Record matching and entity resolution at scale in Spark☆36Updated 2 years ago
- A maximum-strength name parser for record linkage.☆39Updated 4 months ago
- CLK hash: hash pii for entity matching☆47Updated 8 months ago
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Updated 11 months ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆25Updated 3 years ago
- real-time data + ML pipeline☆53Updated last week
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated 3 weeks ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 weeks ago
- Language detection using Spacy and Fasttext☆57Updated 2 years ago
- MLOps simplified. One-stop AI delivery platform, all the features you need.☆106Updated this week
- Now included in rigour☆152Updated 2 months ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆28Updated 3 weeks ago
- A browser user interface for manual labeling of record pairs.☆48Updated 2 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆126Updated last year
- Generating Realistic Synthetic Data☆41Updated last year
- Python library for MIME type parsing, normalisation and grouping.☆13Updated last year
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆33Updated 6 years ago
- ☆21Updated 4 years ago
- CD4AutoML: Continuous Delivery for AutoML with Amazon SageMaker Autopilot and Amazon Step Functions☆13Updated 5 years ago
- Spark NLP for Streamlit☆15Updated 4 years ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆33Updated 2 years ago
- Loading OpenSanctions into Neo4J and Linkurious☆31Updated last year
- ☆17Updated last year
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆32Updated 3 years ago
- Code accompanying AWS blog post "Build a Semantic Search Engine for Tabular Columns with Transformers and Amazon OpenSearch Service"☆18Updated 2 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆47Updated this week