Linguistic processing for Common Voice
☆58Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for commonvoice-utils
Users that are interested in commonvoice-utils are comparing it to the libraries listed below
Sorting:
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- A voice driven 3D chess game for learning Voice AI☆17Jul 6, 2022Updated 3 years ago
- Command line tool to create corpora for Common Voice☆78Feb 16, 2026Updated last month
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆27Sep 23, 2022Updated 3 years ago
- ☆39Feb 24, 2026Updated 3 weeks ago
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- Forced Alignments for Common Voice☆33Oct 30, 2020Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- The grapheme to phoneme model converts Kazakh(Arab|Cyrillic) characters to phonemes.☆12Sep 30, 2019Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Metadata and versioning details for the Common Voice dataset☆168Mar 11, 2026Updated last week
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Workflow for forced alignment between languages☆24Jan 13, 2026Updated 2 months ago
- ☆37Nov 22, 2025Updated 3 months ago
- Tuddar, ismawen d imeḍqan☆10Jan 3, 2020Updated 6 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- Coqui STT (🐸STT) based forced alignment tool☆13Feb 24, 2022Updated 4 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Nov 25, 2025Updated 3 months ago
- A crash course for training speech recognition models using DeepSpeech.☆24May 16, 2021Updated 4 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Sep 16, 2024Updated last year
- Coqui Inference Engine☆40Aug 3, 2021Updated 4 years ago
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- Phoneme alignment representation compatible with multiple forced aligners☆22Apr 12, 2024Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- Coqui AI TTS plugin☆85Jul 2, 2025Updated 8 months ago
- A speech to text IBus engine using VOSK☆36Nov 6, 2022Updated 3 years ago
- PolyglotDB is a package for phonetic corpus storage and analysis☆51Jan 30, 2026Updated last month
- A merged version of multiple open-source German speech datasets.☆34May 3, 2024Updated last year
- Wav2vec2 Large XLSR 53 fine-tuned for Malayalam☆11Sep 7, 2021Updated 4 years ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆149Apr 5, 2024Updated last year