kvh / match
Probabilistic Entity Matching in Python
☆13Updated 7 years ago
Alternatives and similar repositories for match:
Users that are interested in match are comparing it to the libraries listed below
- A browser user interface for manual labeling of record pairs.☆44Updated last year
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A maximum-strength name parser for record linkage.☆36Updated last week
- Resources for tackling record linkage / deduplication / data matching problems☆117Updated 11 months ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- Record Linkage ToolKit (Find and link entities)☆108Updated last year
- ☆10Updated 4 years ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆23Updated 4 months ago
- Scalable String Similarity Joins in Python☆38Updated 7 months ago
- Package that returns a company embedding given a company name☆44Updated 4 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Python script for matching a list of messy addresses against a gazetteer using dedupe.☆62Updated 4 years ago
- Dataframe Integration with spaCy.☆103Updated 3 years ago
- Interactive notebooks containing demonstration code of the splink library☆37Updated last year
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- ☆17Updated 4 years ago
- ☆15Updated 2 years ago
- Topic modelling on financial news with Natural Language Processing☆58Updated 7 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated last year
- Public repository for versioning machine learning data☆42Updated 3 years ago
- Tutorial code and data for the entity resolution workshops.☆43Updated 9 years ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- ☆15Updated 6 years ago