jacksonllee / iso639Links
ISO 639 language codes
☆47Updated 2 months ago
Alternatives and similar repositories for iso639
Users that are interested in iso639 are comparing it to the libraries listed below
Sorting:
- A Python library for working with and comparing language codes.☆351Updated 5 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆76Updated last month
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- Next-generation Punkt sentence boundary detection with zero dependencies☆20Updated 2 months ago
- Faster, modernized fork of the language identification tool langid.py☆59Updated 11 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆178Updated 4 months ago
- Python Finite-State Toolkit☆58Updated 2 weeks ago
- Confection: the sweetest config system for Python☆191Updated 6 months ago
- ☆174Updated 7 months ago
- Tool to fix bitexts and tag near-duplicates for removal☆33Updated last month
- A python module to reduce Unicode to a 'good enough' ASCII representation (outdated Github copy)☆41Updated 14 years ago
- Accurately find/replace/remove emojis in text strings☆162Updated last year
- Pythonic search engine based on PyLucene.☆130Updated 2 weeks ago
- Language detection using Spacy and Fasttext☆57Updated last year
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆110Updated 5 months ago
- Targetted language identifier, based on FastText and Hunspell.☆37Updated last month
- A python package to simulate typographical errors.☆38Updated last year
- Hy-phen-ation made easy☆215Updated 8 months ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated 11 months ago
- Multilingual syllable annotation pipeline component for spacy☆39Updated 2 years ago
- A modern, interlingual wordnet interface for Python☆267Updated last month
- An open-source package for python to clean raw text data☆72Updated 2 years ago
- 📂 Additional lookup tables and data resources for spaCy☆112Updated 4 months ago
- Parse numbers written in natural language☆123Updated last year
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆75Updated 7 months ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆68Updated this week
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆41Updated 2 years ago
- Abydos NLP/IR library for Python☆191Updated 2 years ago