infoscout / weighted-levenshtein
Weighted Levenshtein library
☆106Updated last year
Alternatives and similar repositories for weighted-levenshtein:
Users that are interested in weighted-levenshtein are comparing it to the libraries listed below
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆244Updated 8 months ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Python search module for fast approximate string matching☆54Updated 2 years ago
- Library for unit extraction - fork of quantulum for python3☆135Updated 7 months ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆300Updated 7 months ago
- Fast Word Clustering Software☆76Updated 5 months ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆75Updated 3 years ago
- A Python 3 phonetics library.☆125Updated 4 years ago
- A simple fuzzy matching set for python strings☆225Updated 5 months ago
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 7 years ago
- ☆167Updated 7 months ago
- Levenshtein and Hamming distance computation☆117Updated 5 years ago
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆283Updated last year
- Python bindings for libwapiti☆66Updated 5 years ago
- Character-based word embeddings model based on RNN for handling real world texts☆173Updated last year
- Parse natural language time expressions in python☆131Updated 2 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- Lightning Fast Language Prediction 🚀☆165Updated 5 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- Python Set subclass that supports searching by ngram similarity☆119Updated 3 years ago
- Fast multi-keyword search engine for text strings☆250Updated 4 months ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆96Updated last year
- Misspelling Oblivious Word Embeddings☆203Updated 5 years ago
- a pure-Python PATRICIA trie implementation.☆30Updated 10 years ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 4 months ago
- Language detection extension for spaCy 2.0+☆112Updated 5 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago