takluyver / UnidecodeLinks
A python module to reduce Unicode to a 'good enough' ASCII representation (outdated Github copy)
☆42Updated 14 years ago
Alternatives and similar repositories for Unidecode
Users that are interested in Unidecode are comparing it to the libraries listed below
Sorting:
- A Python library for working with and comparing language codes.☆353Updated 7 months ago
- ASCII transliterations of Unicode text - GitHub mirror☆594Updated 3 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆65Updated 2 weeks ago
- Hy-phen-ation made easy☆217Updated 9 months ago
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language prefere…☆68Updated 3 years ago
- ISO 639 library for Python☆35Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Updated 2 years ago
- Read, write, convert and segment WebVTT caption files in Python.☆227Updated last year
- ☆176Updated 8 months ago
- Fast syllable estimation library based on pattern matching.☆40Updated 2 weeks ago
- ☆564Updated last month
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆111Updated 6 months ago
- Python module that identifies Chinese text as being Simplified or Traditional☆105Updated last year
- Truly universal encoding detector in pure Python.☆722Updated 2 weeks ago
- A python package for grapheme aware string handling☆114Updated 3 years ago
- Find parts of long text or data, allowing for some changes/typos.☆334Updated last month
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆121Updated last month
- Parse numbers written in natural language☆124Updated last year
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆190Updated 4 years ago
- Python wrapper for aspell (C extension and python version)☆82Updated 2 years ago
- Faster, modernized fork of the language identification tool langid.py☆61Updated last year
- ISO 639 language codes☆49Updated last month
- A Python 3 phonetics library.☆136Updated 5 years ago
- A mutable set that remembers the order of its entries. One of Python's missing data types.☆224Updated last year
- Bi-directional transliterator for Python. Transliterates (unicode) strings according to the rules specified in the language packs.☆309Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆180Updated 6 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆77Updated 2 weeks ago
- Convert number words (eg. twenty one) to numeric digits (21)☆180Updated 2 years ago
- A simple immutable mapping for python☆117Updated 3 years ago