arahusky / diacritics_restorationLinks
Neural based model for automatic diacritics restoration.
☆25Updated 6 years ago
Alternatives and similar repositories for diacritics_restoration
Users that are interested in diacritics_restoration are comparing it to the libraries listed below
Sorting:
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆33Updated last week
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆35Updated last week
- phone inventory library☆16Updated 2 years ago
- VoxAngeles Corpus☆12Updated last year
- ☆10Updated 4 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 2 years ago
- IPA tokeniser☆18Updated 2 weeks ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆26Updated 2 years ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- ☆22Updated 3 years ago
- SubER - Subtitle Edit Rate☆22Updated 3 months ago
- Calculates the Word Error Rate between two text files☆20Updated 2 years ago
- Proposed splits for the LREC Wikipron paper☆14Updated 5 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated last year
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆20Updated 5 years ago
- Python Finite-State Toolkit☆57Updated last week
- Gamma Agreement in Python☆45Updated last year
- Repository for multilingual speech data resources for native languages of Zambia.☆18Updated 10 months ago
- ☆18Updated 3 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 6 years ago
- linguistic data on the Yongning Na language☆8Updated last month
- Scripts to create speech corpora from open.bible☆13Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 6 years ago
- Simple Kaldi recipe for forced alignment☆10Updated 2 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 2 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆12Updated 2 years ago
- Compound splitter for German☆108Updated 5 years ago
- Audiobook alignment for Indigenous languages☆40Updated 2 weeks ago
- ☆14Updated 2 years ago
- ☆23Updated 3 years ago