davidmogar / cucco
Text normalization library for Python
☆204Updated 6 years ago
Alternatives and similar repositories for cucco:
Users that are interested in cucco are comparing it to the libraries listed below
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 8 years ago
- NLTK Contrib☆166Updated 11 months ago
- A toolkit for corpus linguistics☆200Updated 5 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 6 months ago
- Natural language processing using unsupervised vectors representation.☆106Updated 5 years ago
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- A Multilingual and Multilevel Representation Learning Toolkit for NLP☆116Updated 7 years ago
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Updated 11 years ago
- Goal: make Pattern compatible with Python 3.☆59Updated 4 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago
- 💫 Scripts, tools and resources for developing spaCy☆125Updated 5 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- A Python implementation of the Metaphone and Double Metaphone algorithms☆81Updated 11 months ago
- Get list of common stop words in various languages in Python☆155Updated 11 months ago
- Python bindings for libwapiti☆66Updated 5 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆169Updated 3 years ago
- A Python 3 phonetics library.☆126Updated 4 years ago
- Python Set subclass that supports searching by ngram similarity☆119Updated 3 years ago
- Python utilities for detecting textual reuse☆21Updated 9 years ago
- PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, an…☆478Updated last year
- CogComp's light-weight Python NLP annotators☆115Updated 6 years ago
- a Deep Learning based Speller☆225Updated 6 years ago
- Socially-Equitable Language Identification☆78Updated last year
- Python bindings for cld3☆27Updated last year
- Python wrapper for Stanford CoreNLP☆353Updated 4 years ago
- Python wrapper for Stanford CoreNLP tools☆58Updated 9 years ago
- Lightning Fast Language Prediction 🚀☆165Updated 5 years ago
- MIT Language Modeling Toolkit☆116Updated 5 years ago
- An extremely simple Python wrapper for the SRI Language Modeling toolkit☆70Updated 10 years ago