OlivierBinette / StringCompare
Efficient String Comparison Functions and Fuzzy String Matching
☆17Updated 2 years ago
Alternatives and similar repositories for StringCompare:
Users that are interested in StringCompare are comparing it to the libraries listed below
- An End-to-End Evaluation Framework for Entity Resolution Systems☆26Updated last year
- A Flexible Deep Learning Approach to Fuzzy String Matching☆139Updated 3 months ago
- A tutorial on entity resolution (record linkage or de-duplication)☆62Updated 4 years ago
- Entity resolution using zero labeled examples☆28Updated 6 months ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Updated 4 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- ☆32Updated 3 years ago
- Fast, flexible name matching for large datasets☆70Updated last year
- ☆15Updated 2 years ago
- pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other do …☆10Updated last year
- A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).☆15Updated 3 years ago
- A very simple library for exploiting graph-of-words in NLP☆12Updated 3 years ago
- ☆9Updated 3 years ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆75Updated 2 months ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated last year
- A browser user interface for manual labeling of record pairs.☆42Updated last year
- Similarity and distance measures for clustering and record linkage applications in R☆17Updated 2 years ago
- Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.☆36Updated last year
- ☆54Updated last year
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- ☆26Updated last week
- Minoan ER is an Entity Resolution (ER) framework, built by researchers in Crete (the land of the ancient Minoan civilization). Entity res…☆17Updated 4 years ago
- ☆31Updated this week
- Pre-print:☆11Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆12Updated 5 months ago
- Process to rebuild the European Court of Human Rights database and datasets from scratch☆21Updated 7 months ago
- A rolling version of the Latent Dirichlet Allocation.☆12Updated last year
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆137Updated 6 months ago
- A maximum-strength name parser for record linkage.☆36Updated 5 months ago