jacksonllee / iso639Links
ISO 639 language codes
☆47Updated last month
Alternatives and similar repositories for iso639
Users that are interested in iso639 are comparing it to the libraries listed below
Sorting:
- A Python library for working with and comparing language codes.☆350Updated 4 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆75Updated 2 weeks ago
- A python package to simulate typographical errors.☆37Updated last year
- Accurately find/replace/remove emojis in text strings☆162Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated last month
- Python Finite-State Toolkit☆58Updated this week
- A sentence segmentation library with wide language support optimized for speed and utility.☆67Updated 2 months ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆108Updated 3 months ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆79Updated 2 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- Pythonic search engine based on PyLucene.☆130Updated last month
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆173Updated 3 months ago
- Confection: the sweetest config system for Python☆190Updated 5 months ago
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- Faster, modernized fork of the language identification tool langid.py☆57Updated 10 months ago
- Tool to fix bitexts and tag near-duplicates for removal☆31Updated 2 weeks ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆51Updated 2 months ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 5 months ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆40Updated 2 years ago
- ☆173Updated 5 months ago
- Targetted language identifier, based on FastText and Hunspell.☆37Updated 2 weeks ago
- Tool for the Automatic Analysis of Syntactic Sophistication and Complexity☆26Updated last year
- OpusFilter - Parallel corpus processing toolkit☆109Updated last month
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆15Updated last year
- Multilingual syllable annotation pipeline component for spacy☆39Updated 2 years ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated 9 months ago
- Transform TMX to text☆27Updated 2 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated 2 weeks ago