lecs-lab / polyglossLinks
A massively multilingual corpus and pretrained model for IGT
☆12Updated this week
Alternatives and similar repositories for polygloss
Users that are interested in polygloss are comparing it to the libraries listed below
Sorting:
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 3 years ago
- A repository containing links to useful phonological software☆12Updated 2 years ago
- phone inventory library☆17Updated 2 years ago
- An R package for implementing and evaluating Maximum Entropy Optimality Theory models☆10Updated last week
- Scripts to create speech corpora from open.bible☆13Updated 4 years ago
- VoxAngeles Corpus☆13Updated 5 months ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- Proposed splits for the LREC Wikipron paper☆15Updated 5 years ago
- The grapheme to phoneme model converts Kazakh(Arab|Cyrillic) characters to phonemes.☆12Updated 6 years ago
- scipts for working with open.bible data☆26Updated 4 years ago
- Workflow for forced alignment between languages☆23Updated 3 weeks ago
- ☆48Updated 8 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆17Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆34Updated 7 months ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- Linguistic processing for Common Voice☆58Updated 2 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆54Updated 2 years ago
- Behavioral probing of language acquisition models at the lexical and syntactic level☆17Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46Updated 2 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 5 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆36Updated 6 months ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆31Updated 2 years ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆43Updated 5 months ago
- Cross-Linguistic Transcription Systems☆17Updated last year
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆11Updated 2 years ago
- Repository for multilingual speech data resources for native languages of Zambia.☆20Updated last year
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Updated 3 years ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆14Updated last year