J535D165 / recordlinkage-annotator
A browser user interface for manual labeling of record pairs.
☆43Updated last year
Alternatives and similar repositories for recordlinkage-annotator:
Users that are interested in recordlinkage-annotator are comparing it to the libraries listed below
- A maximum-strength name parser for record linkage.☆36Updated 5 months ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 5 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Scalable String Similarity Joins in Python☆38Updated 6 months ago
- ☆13Updated 5 years ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- ☆10Updated 4 years ago
- Dataframe Integration with spaCy.☆103Updated 3 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated 5 months ago
- Python package for deduplication/entity resolution using active learning☆78Updated 5 months ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- An End-to-End Evaluation Framework for Entity Resolution Systems☆26Updated last year
- ☆30Updated 2 years ago
- Tools for interactive visual exploration of semantic embeddings.☆29Updated 4 months ago
- Collection of code snippets and utilities for streamlit apps☆22Updated 4 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- Record Linkage ToolKit (Find and link entities)☆108Updated last year
- Fork of the Freely Extensible Biomedical Record Linkage program☆24Updated 8 years ago
- Interactive notebooks containing demonstration code of the splink library☆37Updated last year
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago
- Language detection using Spacy and Fasttext☆54Updated last year
- Bag of, not words, but tricks!☆68Updated last year
- ☄️ Parallel and distributed training with spaCy and Ray☆53Updated last year
- ☆54Updated last year
- Python based Wikidata framework for easy dataframe extraction☆41Updated last year
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- spaCy entry points for Curated Transformers☆26Updated 4 months ago
- quadipy is a python package to help transform structured data into RDF graph format☆18Updated last year
- Public repository for versioning machine learning data☆42Updated 3 years ago