Kozea / Pyphen
Hy-phen-ation made easy
☆202Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Pyphen
- A Python library for working with and comparing language codes.☆339Updated 7 months ago
- The PyICU project repository has moved to https://pyicu.org.☆133Updated 3 years ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated last year
- Parse natural language time expressions in python☆131Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 3 years ago
- Multilingual syllable annotation pipeline component for spacy☆37Updated last year
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆144Updated 10 months ago
- ISO 639 library for Python☆32Updated 2 months ago
- Cython wrapper on Hunspell Dictionary☆65Updated 4 months ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆27Updated 5 years ago
- ASCII transliterations of Unicode text - GitHub mirror☆531Updated 6 months ago
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language prefere…☆69Updated 2 years ago
- A python package for grapheme aware string handling☆108Updated 2 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆64Updated last year
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆77Updated 3 years ago
- linguistics tree drawing to SVG in python, aimed at Jupyter☆62Updated 3 months ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆103Updated 2 weeks ago
- Complete lxml external type annotation☆40Updated last week
- A Python module to discover the etymology of words☆145Updated 6 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆62Updated 2 months ago
- Language detection extension for spaCy 2.0+☆111Updated 5 years ago
- NLTK Contrib☆166Updated 8 months ago
- Abydos NLP/IR library for Python☆183Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)☆203Updated 2 years ago
- Python package for Google's diff-match-patch native C++ implementation.☆73Updated 5 months ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- A simple fuzzy matching set for python strings☆223Updated 3 months ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 2 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year