rspeer / langcodes
A Python library for working with and comparing language codes.
☆346Updated 4 months ago
Alternatives and similar repositories for langcodes:
Users that are interested in langcodes are comparing it to the libraries listed below
- ASCII transliterations of Unicode text - GitHub mirror☆560Updated last week
- Hy-phen-ation made easy☆212Updated 2 months ago
- A python module to reduce Unicode to a 'good enough' ASCII representation (outdated Github copy)☆40Updated 14 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆151Updated last year
- ☆169Updated last month
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated 2 years ago
- ISO 639 library for Python☆32Updated 8 months ago
- Cython wrapper on Hunspell Dictionary☆66Updated 10 months ago
- universal character encoding detector☆400Updated 4 months ago
- A python package for grapheme aware string handling☆112Updated 3 years ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆52Updated 4 months ago
- The PyICU project repository has moved to https://pyicu.org.☆133Updated 4 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆71Updated this week
- ISO 639 language codes☆44Updated 2 months ago
- Python wrapper for RE2☆103Updated 3 weeks ago
- Abydos NLP/IR library for Python☆185Updated 2 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆374Updated 2 years ago
- URL normalization for Python☆94Updated last week
- python library to simplify working with jsonlines and ndjson data☆293Updated 9 months ago
- Truly universal encoding detector in pure Python☆644Updated this week
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆66Updated 2 years ago
- Fast Python Bloom Filter using Mmap☆127Updated 11 months ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆169Updated 3 years ago
- Python stemming library using snowball stemmers☆255Updated 6 months ago
- Levenshtein and Hamming distance computation☆116Updated 5 years ago
- A Python implementation of Lunr.js 🌖☆195Updated last month
- Pythonic search engine based on PyLucene.☆126Updated 5 months ago
- unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language prefere…☆69Updated 2 years ago
- Parse natural language time expressions in python☆130Updated 2 years ago
- Lightning Fast Language Prediction 🚀☆166Updated 6 years ago