J535D165 / recordlinkage-annotator
A browser user interface for manual labeling of record pairs.
☆41Updated last year
Related projects ⓘ
Alternatives and complementary repositories for recordlinkage-annotator
- A maximum-strength name parser for record linkage.☆32Updated 3 months ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- Scalable String Similarity Joins in Python☆39Updated 3 months ago
- Fast, flexible name matching for large datasets☆70Updated 10 months ago
- ☆13Updated 5 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 5 years ago
- List of entity resolution software and resources.☆35Updated 8 months ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated 9 months ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- Dataframe Integration with spaCy.☆101Updated 3 years ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated last month
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 2 years ago
- Generate reports for spaCy models.☆28Updated 2 years ago
- Annotation Management for Prodigy, that support multiple users working in many projects☆15Updated 5 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 3 years ago
- Tools for interactive visual exploration of semantic embeddings.☆28Updated 2 months ago
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated 2 months ago
- Language detection using Spacy and Fasttext☆54Updated 10 months ago
- Python implementation of anonymous linkage using cryptographic linkage keys☆63Updated 5 months ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆25Updated 11 months ago
- A tutorial on entity resolution (record linkage or de-duplication)☆61Updated 4 years ago
- Probabilistic Entity Matching in Python☆13Updated 7 years ago
- ☆10Updated 4 years ago
- ☆29Updated 2 years ago
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆53Updated last month
- Notebooks configured to be run with Binder, usually found on my blog.☆41Updated last year
- Python package for deduplication/entity resolution using active learning☆79Updated 2 months ago