doukremt / distance
Levenshtein and Hamming distance computation
☆116Updated 5 years ago
Alternatives and similar repositories for distance:
Users that are interested in distance are comparing it to the libraries listed below
- Python search module for fast approximate string matching☆54Updated 2 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 9 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 10 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆169Updated 3 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆301Updated 10 months ago
- A Python implementation of the Metaphone and Double Metaphone algorithms☆81Updated last year
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆58Updated 3 weeks ago
- Sentiment analysis made easy; built on top off solid libraries.☆24Updated 8 years ago
- Snowball stemming library collection for Python☆121Updated 6 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- 💫 Scripts, tools and resources for developing spaCy☆126Updated 6 years ago
- HAT-Trie for Python☆86Updated 9 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆87Updated 3 months ago
- SPARK-n-SPELL [WARNING: inactive project, not being updated]☆7Updated 8 years ago
- A simple fuzzy matching set for python strings☆227Updated 8 months ago
- Lightning Fast Language Prediction 🚀☆166Updated 6 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆108Updated 11 years ago
- Aho-Corasick string replacement utility☆24Updated 5 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 6 years ago
- ☆24Updated 7 years ago
- A Python framework for exploring distributional semantic models.☆85Updated 9 years ago
- Python wrapper for RE2☆296Updated 2 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- scikit-learn wrappers for Python fastText.☆232Updated 2 years ago
- Python wrapper for aspell (C extension and python version)☆82Updated last year
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- Python Set subclass that supports searching by ngram similarity☆119Updated 3 years ago