infoscout / weighted-levenshtein
Weighted Levenshtein library
☆105Updated last year
Related projects ⓘ
Alternatives and complementary repositories for weighted-levenshtein
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆243Updated 6 months ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 3 months ago
- A Python 3 phonetics library.☆124Updated 4 years ago
- Language independent truecaser in Python.☆161Updated 3 years ago
- Hidden alignment conditional random field for classifying string pairs.☆37Updated 7 years ago
- Named Entity Recognition data for Europeana Newspapers☆173Updated last year
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- A Python implementation of the Metaphone and Double Metaphone algorithms☆80Updated 8 months ago
- Python search module for fast approximate string matching☆53Updated last year
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆75Updated 2 years ago
- Fast multi-keyword search engine for text strings☆247Updated 2 months ago
- An unsupervised compound splitter☆40Updated 5 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆137Updated 4 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆67Updated 2 weeks ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- ☆165Updated 5 months ago
- Language detection extension for spaCy 2.0+☆111Updated 5 years ago
- Character-based word embeddings model based on RNN for handling real world texts☆172Updated last year
- Abydos NLP/IR library for Python☆183Updated 2 years ago
- Making sense embedding out of word embeddings using graph-based word sense induction☆212Updated 3 years ago
- A simple fuzzy matching set for python strings☆223Updated 3 months ago
- Compound splitter for German☆103Updated 4 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆193Updated last year
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- GermaNet API for Python☆53Updated 6 years ago
- Python bindings for libwapiti☆66Updated 4 years ago
- Cython wrapper on Hunspell Dictionary☆65Updated 4 months ago
- Text tokenization and sentence segmentation (segtok v2)☆203Updated 2 years ago