Linguistic processing for Common Voice
β58Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for commonvoice-utils
Users that are interested in commonvoice-utils are comparing it to the libraries listed below
Sorting:
- scipts for working with open.bible dataβ26Jan 24, 2022Updated 4 years ago
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- A voice driven 3D chess game for learning Voice AIβ17Jul 6, 2022Updated 3 years ago
- Command line tool to create corpora for Common Voiceβ78Feb 16, 2026Updated last week
- β39Feb 2, 2026Updated 3 weeks ago
- The grapheme to phoneme model converts Kazakh(Arab|Cyrillic) characters to phonemes.β12Sep 30, 2019Updated 6 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Mar 30, 2021Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Mar 26, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Feb 13, 2021Updated 5 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languagesβ174Jun 9, 2023Updated 2 years ago
- TTS Client for Coqui TTS serverβ13Jan 7, 2023Updated 3 years ago
- Phoneme segmentation using pre-trained speech modelsβ55Nov 4, 2022Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ27Sep 23, 2022Updated 3 years ago
- Workflow for forced alignment between languagesβ23Jan 13, 2026Updated last month
- Coqui STT (πΈSTT) based forced alignment toolβ13Feb 24, 2022Updated 4 years ago
- Scripts to create speech corpora from open.bibleβ13Jan 3, 2022Updated 4 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based β¦β16Sep 5, 2017Updated 8 years ago
- β37Nov 22, 2025Updated 3 months ago
- β12Jun 10, 2021Updated 4 years ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language Mβ¦β20Jan 3, 2023Updated 3 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-coreβ15Jun 19, 2023Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Modelsβ14Oct 19, 2022Updated 3 years ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. Oβ¦β66Feb 26, 2024Updated 2 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.β18May 31, 2023Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpusβ185Dec 6, 2024Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognitionβ28Dec 16, 2022Updated 3 years ago
- Metadata and versioning details for the Common Voice datasetβ166Feb 16, 2026Updated last week
- β20Jul 22, 2022Updated 3 years ago
- Forced Alignments for Common Voiceβ32Oct 30, 2020Updated 5 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic biasβ20Jul 21, 2020Updated 5 years ago
- ASR text preprocessing utilityβ21Aug 5, 2024Updated last year
- Coqui AI TTS pluginβ85Jul 2, 2025Updated 7 months ago
- Properly handle position-dependent phones in a subword lexicon FSTβ31Oct 26, 2020Updated 5 years ago
- Coqui Inference Engineβ40Aug 3, 2021Updated 4 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languagesβ147Apr 5, 2024Updated last year
- Breaks a word into syllables using an LSTM-based neural network.β20Aug 14, 2023Updated 2 years ago
- A handy dataset of noises for ASRβ22May 29, 2019Updated 6 years ago
- Phoneme alignment representation compatible with multiple forced alignersβ22Apr 12, 2024Updated last year
- Phonetically-Oriented Word Error Rateβ36May 4, 2019Updated 6 years ago