infoscout / weighted-levenshtein
Weighted Levenshtein library
☆105Updated last year
Related projects ⓘ
Alternatives and complementary repositories for weighted-levenshtein
- Hidden alignment conditional random field for classifying string pairs.☆37Updated 7 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 2 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆243Updated 6 months ago
- Language independent truecaser in Python.☆161Updated 3 years ago
- A Python 3 phonetics library.☆124Updated 4 years ago
- Language detection extension for spaCy 2.0+☆111Updated 5 years ago
- Parse natural language time expressions in python☆131Updated last year
- Cython wrapper on Hunspell Dictionary☆65Updated 4 months ago
- Character-based word embeddings model based on RNN for handling real world texts☆172Updated last year
- ☆165Updated 5 months ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated last year
- Fast, DB Backed pretrained word embeddings for natural language processing.☆223Updated last year
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 3 months ago
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- Abydos NLP/IR library for Python☆183Updated 2 years ago
- Fast Word Clustering Software☆74Updated 2 months ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆149Updated last year
- Labeled examples from wiki dumps in Python☆68Updated 8 years ago
- Fast multi-keyword search engine for text strings☆247Updated last month
- spaCy + UDPipe☆161Updated 2 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- Lightning Fast Language Prediction 🚀☆165Updated 5 years ago
- Python bindings for libwapiti☆66Updated 4 years ago
- Compound splitter for German☆103Updated 4 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 5 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆82Updated last year
- A fully customisable language detection pipeline for spaCy☆93Updated 5 years ago