kariminf / lang-trans
Python transliteration library (mostly from non-latin scripts, such as Arabic, Japanese, etc.)
☆20Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for lang-trans
- ☆22Updated 2 years ago
- Pronounce Arabic words☆18Updated 5 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆13Updated 3 years ago
- Benchmark Arabic text diacritization dataset☆71Updated 5 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- Includes an Arabic diacritizer, IPA converter, and arabic-only filter☆14Updated 7 years ago
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK☆58Updated 7 years ago
- Country-level Arabic dialect identification (17 Arabic countries)☆43Updated 4 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆14Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- phone inventory library☆15Updated last year
- This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on…☆38Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆37Updated last year
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆37Updated last year
- IPA tokeniser☆16Updated 7 months ago
- This repository☆30Updated 2 years ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 3 years ago
- Morfessor EM+Prune☆10Updated 4 years ago
- A repository for the 2022 Inflection Shared Task☆9Updated 2 years ago
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆80Updated last year
- ☆19Updated 3 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Updated 7 years ago
- Automatic Dialect Detection Repository☆39Updated 2 years ago
- Simple Kaldi recipe for forced alignment☆10Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆36Updated last year
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 2 years ago
- Python Finite-State Toolkit☆45Updated last week
- Python module for syllabifying English ARPABET transcriptions☆64Updated 5 years ago