jacksonllee / iso639Links
ISO 639 language codes
☆45Updated 4 months ago
Alternatives and similar repositories for iso639
Users that are interested in iso639 are comparing it to the libraries listed below
Sorting:
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 5 years ago
- A python package to simulate typographical errors.☆35Updated last year
- Cython wrapper on Hunspell Dictionary☆66Updated 11 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 4 months ago
- ☆15Updated 3 years ago
- Python Finite-State Toolkit☆56Updated last week
- Tool to fix bitexts and tag near-duplicates for removal☆30Updated 4 months ago
- Rust-based Python wrapper for duckling library in Haskell☆25Updated 4 years ago
- Gamma Agreement in Python☆44Updated last year
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 6 months ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Updated 3 years ago
- A flexible sentence segmentation library using CRF model and regex rules☆29Updated last year
- ☆22Updated 3 years ago
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.☆17Updated last year
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆107Updated 3 weeks ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆73Updated last month
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆86Updated last year
- Faster, modernized fork of the language identification tool langid.py☆56Updated 7 months ago
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆28Updated 3 months ago
- Fast syllable estimation library based on pattern matching.☆39Updated 3 months ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆82Updated 9 months ago
- A library for data streaming and augmentation☆20Updated last month
- Transform TMX to text☆27Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- A fast, comprehensive, ISO 639 library.☆39Updated 4 months ago
- python package for calculating famous measures in computational linguistics☆14Updated 7 months ago
- universal syllabification algorithms☆44Updated 2 years ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆39Updated 2 years ago
- bin files☆13Updated 4 months ago