Christopher-Thornton / hmni
π Fuzzy Name Matching with Machine Learning
β264Updated 9 months ago
Alternatives and similar repositories for hmni:
Users that are interested in hmni are comparing it to the libraries listed below
- Fast, flexible name matching for large datasetsβ71Updated last year
- Fuzzy matching and more functionality for spaCy.β256Updated 9 months ago
- Super Fast String Matching in Pythonβ367Updated 3 weeks ago
- Python package to accelerate the sparse matrix multiplication and top-n similarity selectionβ404Updated 2 weeks ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Pythonβ137Updated 8 months ago
- demo using FuzzyWuzzy matching company namesβ74Updated 3 years ago
- Text analysis with networks.β284Updated last week
- Package that returns a company embedding given a company nameβ45Updated 4 years ago
- Company Name Processor written in Pythonβ336Updated 10 months ago
- spaCy pipeline object for negating concepts in textβ277Updated 10 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entiβ¦β245Updated last year
- SpikeX - SpaCy Pipes for Knowledge Extractionβ398Updated 3 years ago
- PYthon Automated Term Extractionβ311Updated 2 years ago
- Fuzzy string matching, grouping, and evaluation.β758Updated last month
- Abydos NLP/IR library for Pythonβ185Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasksβ922Updated 7 months ago
- β189Updated 10 months ago
- Spacy NER annotator using ipywidgetsβ120Updated last year
- semi supervised guided topic model with custom guidedLDAβ505Updated 4 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Pythonβ997Updated last year
- A Corpus of 475,000 Industrial Occupationsβ66Updated 4 years ago
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learningβ310Updated last month
- Google USE (Universal Sentence Encoder) for spaCyβ183Updated 2 years ago
- A spaCy pipeline and model for NLP on unstructured legal text.β648Updated 8 months ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!β472Updated 2 years ago
- Steam review texting embedding analysisβ141Updated 2 years ago
- A Python library for calculating a large variety of metrics from textβ334Updated 3 months ago
- Python package for performing Entity and Text Matching using Deep Learning.β586Updated 9 months ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text dataβ¦β242Updated 11 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matchingβ144Updated 5 months ago