lanl / pyxDamerauLevenshtein
pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.
☆246Updated 10 months ago
Alternatives and similar repositories for pyxDamerauLevenshtein:
Users that are interested in pyxDamerauLevenshtein are comparing it to the libraries listed below
- Fast multi-keyword search engine for text strings☆252Updated 6 months ago
- A simple fuzzy matching set for python strings☆225Updated 7 months ago
- An efficient simhash implementation for python☆124Updated 5 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆386Updated 2 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago
- Python Set subclass that supports searching by ngram similarity☆119Updated 3 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆300Updated 9 months ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆169Updated 3 years ago
- Text normalization library for Python☆204Updated 7 years ago
- spellchecking library for python☆608Updated 9 months ago
- Python stemming library using snowball stemmers☆250Updated 5 months ago
- Weighted Levenshtein library☆106Updated last year
- ☆130Updated 3 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆136Updated 8 months ago
- Extract countries, regions and cities from a URL or text☆218Updated 4 years ago
- Fast implementation of the edit distance(Levenshtein distance)☆681Updated last year
- Levenshtein and Hamming distance computation☆116Updated 5 years ago
- Iterative JSON parser with Pythonic interface☆620Updated 5 years ago
- scikit-learn inspired API for CRFsuite☆427Updated last year
- Get list of common stop words in various languages in Python☆155Updated last year
- Textpipe: clean and extract metadata from text☆302Updated 3 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆374Updated 2 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,274Updated 3 years ago
- Python binding to libpoppler with focus on text extraction☆97Updated 3 years ago
- Thin wrapper for the Microsoft Cognitive Services☆60Updated 7 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- Yet another Python binding for fastText☆226Updated 6 years ago
- SciKit-Learn Laboratory (SKLL) makes it easy to run machine learning experiments.☆553Updated 3 months ago
- 💫 Scripts, tools and resources for developing spaCy☆125Updated 6 years ago