richmilne / JaroWinkler
Original, standard and customisable versions of the Jaro-Winkler functions.
β31Updated 2 years ago
Alternatives and similar repositories for JaroWinkler:
Users that are interested in JaroWinkler are comparing it to the libraries listed below
- Multi-Langauge Identificationβ28Updated 9 months ago
- Language detection using Spacy and Fasttextβ55Updated last year
- π₯ Cython hash tables that assume keys are pre-hashedβ87Updated 3 months ago
- β70Updated 2 years ago
- Scalable String Similarity Joins in Pythonβ39Updated 9 months ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any otheβ¦β67Updated 2 years ago
- Ensemble topic modeling with matrix factorizationβ25Updated 6 years ago
- β30Updated 2 years ago
- Graph extraction and NLP analysis for Baleen Corporaβ18Updated 8 years ago
- This is an Object Oriented implementation of a Trie in python. The class contains setter and getter methods, and implements several usefuβ¦β14Updated 7 years ago
- A Python implementation of the Metaphone and Double Metaphone algorithmsβ81Updated last year
- Python wrapper for aspell (C extension and python version)β82Updated last year
- An index data structure for approximate string search.β23Updated 5 years ago
- Text readability metrics in Python.β11Updated 11 years ago
- An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)β12Updated 4 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)β36Updated 10 years ago
- A disk-based key/value store in Python with no dependencies.β21Updated 9 years ago
- Language detection extension for spaCy 2.0+β112Updated 6 years ago
- Finds linguistic patterns effortlesslyβ36Updated last year
- Find which links on a web page are pagination linksβ29Updated 8 years ago
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.β97Updated 2 years ago
- Force-Atlas 2 graph layout in networkxβ22Updated 10 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)β66Updated 2 years ago
- sequence tagging with spaCy and crfsuiteβ19Updated 2 years ago
- Cython wrapper on Hunspell Dictionaryβ23Updated last year
- A basic python wrapper for Natty natural language date parserβ17Updated 10 years ago
- IPython Magic for exporting pandas objects to Excelβ13Updated 7 years ago
- Python wrapper for Apache OpenNLP toolsβ34Updated 8 years ago
- A fully customisable language detection pipeline for spaCyβ92Updated 5 years ago