doukremt / distance
Levenshtein and Hamming distance computation
☆117Updated 4 years ago
Related projects: ⓘ
- Language detection extension for spaCy 2.0+☆111Updated 5 years ago
- Python search module for fast approximate string matching☆53Updated last year
- A Python implementation of the Metaphone and Double Metaphone algorithms☆80Updated 6 months ago
- A simple fuzzy matching set for python strings☆222Updated last month
- Snowball stemming library collection for Python☆123Updated 5 years ago
- Python bindings for the Google's FarmHash☆37Updated 3 weeks ago
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- 💥 Cython hash tables that assume keys are pre-hashed☆82Updated 10 months ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆242Updated 4 months ago
- Python bindings to the Compact Language Detector☆32Updated 4 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆380Updated 2 years ago
- 💫 Scripts, tools and resources for developing spaCy☆125Updated 5 years ago
- Hunspell extension for spaCy 2.0.☆94Updated last month
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆109Updated 11 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated last year
- Text normalization library for Python☆202Updated 6 years ago
- Python Set subclass that supports searching by ngram similarity☆119Updated 3 years ago
- ☆46Updated this week
- Lightning Fast Language Prediction 🚀☆163Updated 5 years ago
- ☆50Updated last year
- Locality-sensitive hashing algorithm for text similarity comparisons☆58Updated 2 years ago
- A Python library for extracting semantic information from text, such as dates and numbers.☆74Updated 2 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 5 years ago
- Textpipe: clean and extract metadata from text☆300Updated 3 years ago
- Time everything in IPython☆118Updated 10 months ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 6 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆299Updated 3 months ago
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 5 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆144Updated 8 months ago