Christopher-Thornton / hmniLinks
π Fuzzy Name Matching with Machine Learning
β264Updated last year
Alternatives and similar repositories for hmni
Users that are interested in hmni are comparing it to the libraries listed below
Sorting:
- Fuzzy matching and more functionality for spaCy.β256Updated last year
- Super Fast String Matching in Pythonβ369Updated 4 months ago
- Fuzzy string matching, grouping, and evaluation.β773Updated 3 weeks ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing powerβ190Updated 2 years ago
- SpikeX - SpaCy Pipes for Knowledge Extractionβ399Updated 4 years ago
- Python package to accelerate the sparse matrix multiplication and top-n similarity selectionβ411Updated this week
- PYthon Automated Term Extractionβ315Updated 2 years ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text dataβ¦β242Updated last year
- Dataframe Integration with spaCy.β103Updated 4 years ago
- Fixes contractions such as `you're` to `you are`β318Updated 2 years ago
- βοΈ Parallel and distributed training with spaCy and Rayβ56Updated 2 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4β283Updated 2 years ago
- Google USE (Universal Sentence Encoder) for spaCyβ184Updated 2 years ago
- Package that returns a company embedding given a company nameβ46Updated 5 years ago
- Spacy NER annotator using ipywidgetsβ123Updated last year
- β192Updated last year
- Fast, flexible name matching for large datasetsβ72Updated 2 months ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Pythonβ140Updated last year
- 𧬠A JupyterLab extension for annotating data with Prodigyβ189Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entiβ¦β245Updated 2 years ago
- Text analysis with networks.β287Updated 3 months ago
- Company Name Processor written in Pythonβ341Updated last year
- spaCy pipeline object for negating concepts in textβ281Updated last month
- Information extraction from English and German texts based on predicate logicβ391Updated 3 years ago
- Textpipe: clean and extract metadata from textβ302Updated 4 years ago
- Information extraction from English and German texts based on predicate logicβ138Updated 2 years ago
- Simplifies use of the Dedupe library via Pandasβ136Updated 2 years ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasksβ159Updated 2 years ago
- demo using FuzzyWuzzy matching company namesβ75Updated 3 years ago
- π³ Recipes for the Prodigy, our fully scriptable annotation toolβ496Updated 11 months ago