Kozea / PyphenLinks
Hy-phen-ation made easy
☆210Updated 4 months ago
Alternatives and similar repositories for Pyphen
Users that are interested in Pyphen are comparing it to the libraries listed below
Sorting:
- The PyICU project repository has moved to https://pyicu.org.☆133Updated 4 years ago
- A Python library for working with and comparing language codes.☆345Updated last month
- Cython wrapper on Hunspell Dictionary☆66Updated 11 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆63Updated 2 months ago
- ISO 639 library for Python☆33Updated 9 months ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 3 years ago
- A python module to reduce Unicode to a 'good enough' ASCII representation (outdated Github copy)☆40Updated 14 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆107Updated 3 weeks ago
- unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language prefere…☆68Updated 2 years ago
- Text normalization library for Python☆204Updated 7 years ago
- Automatically exported from code.google.com/p/foma☆122Updated 4 months ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆152Updated 5 months ago
- Parse numbers written in natural language☆117Updated 8 months ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆194Updated 4 years ago
- Python package for Google's diff-match-patch native C++ implementation.☆79Updated last year
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated 2 years ago
- Multilingual syllable annotation pipeline component for spacy☆39Updated 2 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated 3 weeks ago
- A Python port of the Apache Lucene ASCII Folding Filter that converts alphabetic, numeric, and symbolic Unicode characters which are not …☆15Updated 5 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago
- A python package for grapheme aware string handling☆112Updated 3 years ago
- a Python implementation of the Unicode Collation Algorithm☆220Updated last year
- LingPy: Python library for quantitative tasks in historical linguistics☆134Updated 3 months ago
- Abydos NLP/IR library for Python☆186Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 10 months ago
- Python wrapper for aspell (C extension and python version)☆82Updated last year
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆373Updated 2 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆66Updated 2 years ago
- A compound word splitter for Python☆48Updated 3 years ago