antoinewdg / pyffsLinks
Python implementation of Leveshtein automata
☆25Updated 6 years ago
Alternatives and similar repositories for pyffs
Users that are interested in pyffs are comparing it to the libraries listed below
Sorting:
- Find parts of long text or data, allowing for some changes/typos.☆334Updated last month
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- Jupyter Widget for data annotation☆141Updated 2 years ago
- Parse natural language time expressions in python☆131Updated 3 years ago
- NER, syntax markup visualizations☆140Updated 2 years ago
- ☆176Updated 8 months ago
- Text vectorization tool to outperform TFIDF for classification tasks☆195Updated 2 weeks ago
- LASER multilingual sentence embeddings as a pip package☆225Updated 2 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- Find strings/words in text; convenience and C speed☆127Updated 3 years ago
- Python wrapper for LanguageTool grammar checker☆329Updated 4 years ago
- Python Set subclass that supports searching by ngram similarity☆119Updated 4 years ago
- "Python Rule-based feAture sTructure Analysis" or "Python Rule-bAsed Text Analysis"☆70Updated 4 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆190Updated 4 years ago
- Fast multi-keyword search engine for text strings☆258Updated last year
- Python wrapper for aspell (C extension and python version)☆82Updated 2 years ago
- Text normalization library for Python☆203Updated 7 years ago
- Python bindings for cld3☆27Updated 2 years ago
- Abydos NLP/IR library for Python☆193Updated 3 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆250Updated 2 months ago
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- Language independent truecaser in Python.☆159Updated 4 years ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- Text tokenization and sentence segmentation (segtok v2)☆208Updated 3 years ago
- Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition☆55Updated 12 years ago
- A python module for English lemmatization and inflection.☆274Updated 2 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 2 years ago
- Python wrapper for Stanford CoreNLP's SUTime☆162Updated 2 years ago
- ☆129Updated 4 years ago
- Python BK-tree data structure to allow fast querying of "close" matches☆186Updated 4 years ago