rspeer / langcodes
A Python library for working with and comparing language codes.
☆342Updated 2 months ago
Alternatives and similar repositories for langcodes:
Users that are interested in langcodes are comparing it to the libraries listed below
- ASCII transliterations of Unicode text - GitHub mirror☆542Updated 9 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- Hy-phen-ation made easy☆207Updated 3 weeks ago
- ISO 639 library for Python☆32Updated 5 months ago
- ☆167Updated 8 months ago
- A python package for grapheme aware string handling☆110Updated 2 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated 7 months ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆96Updated last year
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆65Updated last year
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- ☆481Updated this week
- The PyICU project repository has moved to https://pyicu.org.☆133Updated 3 years ago
- A Python implementation of Lunr.js 🌖☆196Updated last month
- ISO 639 language codes☆39Updated this week
- URL normalization for Python☆94Updated 2 years ago
- Parse numbers written in natural language☆109Updated 3 months ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to words☆1,000Updated last month
- Hunspell extension for spaCy 2.0.☆94Updated 6 months ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 2 years ago
- unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language prefere…☆69Updated 2 years ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆51Updated last month
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆296Updated last month
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆449Updated 3 weeks ago
- A python module to reduce Unicode to a 'good enough' ASCII representation (outdated Github copy)☆38Updated 14 years ago
- Lightning Fast Language Prediction 🚀☆165Updated 5 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆151Updated 2 months ago
- A Python module that tries to figure out what your local timezone is☆191Updated this week
- Python stemming library using snowball stemmers☆249Updated 4 months ago
- A compound word splitter for Python☆48Updated 3 years ago