antoinewdg / pyffsLinks
Python implementation of Leveshtein automata
☆25Updated 6 years ago
Alternatives and similar repositories for pyffs
Users that are interested in pyffs are comparing it to the libraries listed below
Sorting:
- Parse natural language time expressions in python☆131Updated 3 years ago
- Python wrapper for aspell (C extension and python version)☆82Updated 2 years ago
- A fast, simple, multilingual tokenizer☆29Updated 8 years ago
- NER, syntax markup visualizations☆140Updated 2 years ago
- A small tool that EXPLains spACY parse results. See what I did there?☆84Updated 3 years ago
- Python wrapper for LanguageTool grammar checker☆329Updated 4 years ago
- A python module for word inflections designed for use with spaCy.☆93Updated 6 years ago
- Jupyter Widget for data annotation☆140Updated 3 years ago
- Python Set subclass that supports searching by ngram similarity☆119Updated 4 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆189Updated 5 years ago
- Fast, DB Backed pretrained word embeddings for natural language processing.☆224Updated 10 months ago
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated 2 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆250Updated 4 months ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- Fast multi-keyword search engine for text strings☆258Updated last year
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 3 years ago
- Text vectorization tool to outperform TFIDF for classification tasks☆197Updated 2 months ago
- Find parts of long text or data, allowing for some changes/typos.☆339Updated 3 months ago
- Extension of scikit-learn TfidfVectorizer and CountVectorizer that allows for online learning / partial fit.☆34Updated 8 years ago
- Language independent truecaser in Python.☆160Updated 4 years ago
- Text normalization library for Python☆202Updated 7 years ago
- ☆178Updated 10 months ago
- A compound word splitter for Python☆49Updated 4 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆67Updated last week
- A simple fuzzy matching set for python strings☆230Updated last year
- "Python Rule-based feAture sTructure Analysis" or "Python Rule-bAsed Text Analysis"☆70Updated 4 years ago
- Python BK-tree data structure to allow fast querying of "close" matches☆187Updated 4 years ago
- Python 3 Spelling Corrector☆178Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)☆208Updated 3 years ago