jacksonllee / iso639Links
ISO 639 language codes
☆47Updated 3 weeks ago
Alternatives and similar repositories for iso639
Users that are interested in iso639 are comparing it to the libraries listed below
Sorting:
- A Python library for working with and comparing language codes.☆346Updated 3 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆74Updated 2 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆108Updated 3 months ago
- Pythonic search engine based on PyLucene.☆129Updated last week
- Accurately find/replace/remove emojis in text strings☆162Updated last year
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- Python Finite-State Toolkit☆58Updated this week
- ☆173Updated 5 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆171Updated 2 months ago
- Faster, modernized fork of the language identification tool langid.py☆56Updated 9 months ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆66Updated 2 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 3 weeks ago
- Parse numbers written in natural language☆122Updated 10 months ago
- Hy-phen-ation made easy☆211Updated 6 months ago
- A python package to simulate typographical errors.☆37Updated last year
- Confection: the sweetest config system for Python☆188Updated 4 months ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆40Updated 2 years ago
- ISO 639 library for Python☆34Updated last year
- A Python implementation of Lunr.js 🌖☆199Updated 5 months ago
- A python module to reduce Unicode to a 'good enough' ASCII representation (outdated Github copy)☆40Updated 14 years ago
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Tool to fix bitexts and tag near-duplicates for removal☆31Updated 6 months ago
- Abydos NLP/IR library for Python☆188Updated 2 years ago
- Convert number words (eg. twenty one) to numeric digits (21)☆178Updated 2 years ago
- ☆74Updated last week
- Language detection using Spacy and Fasttext☆57Updated last year
- 80x faster and 95% accurate language identification with Fasttext☆162Updated last year
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆24Updated 2 months ago