ftyers/commonvoice-utils

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ftyers/commonvoice-utils)

ftyers / commonvoice-utils

Linguistic processing for Common Voice

☆59

Alternatives and similar repositories for commonvoice-utils

Users that are interested in commonvoice-utils are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

coqui-ai / open-bible-scripts
View on GitHub
scipts for working with open.bible data
☆26Jan 24, 2022Updated 4 years ago
HarikalarKutusu / 3d-voice-chess
View on GitHub
A voice driven 3D chess game for learning Voice AI
☆17Jul 6, 2022Updated 4 years ago
common-voice / CorporaCreator
View on GitHub
Command line tool to create corpora for Common Voice
☆78Mar 25, 2026Updated 3 months ago
coqui-ai / data-checker
View on GitHub
🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
KathyReid / opensource-voice-tools
View on GitHub
A repo listing known open source voice tools, ordered by where they sit in the voice stack
☆28Sep 23, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
thorstenMueller / cTTS
View on GitHub
TTS Client for Coqui TTS server
☆13Jan 7, 2023Updated 3 years ago
CohenPr-XPF / XPF
View on GitHub
☆39Feb 24, 2026Updated 4 months ago
JRMeyer / common-voice-forced-alignments
View on GitHub
Forced Alignments for Common Voice
☆33Oct 30, 2020Updated 5 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
Ardaq / kz_g2p
View on GitHub
The grapheme to phoneme model converts Kazakh(Arab|Cyrillic) characters to phonemes.
☆12Sep 30, 2019Updated 6 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
MohammedBelkacem / corpus-kab
View on GitHub
Tuddar, ismawen d imeḍqan
☆11Jan 3, 2020Updated 6 years ago
jhdeov / interlingual-MFA
View on GitHub
Workflow for forced alignment between languages
☆25May 7, 2026Updated 2 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
xinjli / transphone
View on GitHub
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
☆174Jun 9, 2023Updated 3 years ago
common-voice / cv-dataset
View on GitHub
Metadata and versioning details for the Common Voice dataset
☆173Jun 16, 2026Updated last month
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
domcross / german-stt-evaluation
View on GitHub
Evaluation of STT models for german language
☆16Jan 22, 2022Updated 4 years ago
alpoktem / bible2speechDB
View on GitHub
Scripts to create speech corpora from open.bible
☆13Jan 3, 2022Updated 4 years ago
gullabi / STT-align
View on GitHub
Coqui STT (🐸STT) based forced alignment tool
☆13Feb 24, 2022Updated 4 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
smaybius / Coqui-TTS-GUI-solution
View on GitHub
Interface for using TTS and vocoder models in the form of a text editor
☆20Nov 25, 2025Updated 7 months ago
mozilla / deepspeech-playbook
View on GitHub
DEPRECATED - A crash course for training speech recognition models using DeepSpeech.
☆24May 16, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
getalp / mass-dataset
View on GitHub
MaSS - Multilingual corpus of Sentence-aligned Spoken utterances
☆50Sep 16, 2024Updated last year
lstrgar / ss-phoneme-seg
View on GitHub
Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…
☆55Nov 4, 2022Updated 3 years ago
RuABraun / texterrors
View on GitHub
☆37Jun 9, 2026Updated last month
neulab / AfricanVoices
View on GitHub
Hosts text-to-speech corpus and speech synthesizers for African languages.
☆19May 31, 2023Updated 3 years ago
maxrmorrison / pypar
View on GitHub
Phoneme alignment representation compatible with multiple forced aligners
☆22Apr 12, 2024Updated 2 years ago
coqui-ai / inference-engine
View on GitHub
Coqui Inference Engine
☆41Aug 3, 2021Updated 4 years ago
AI-Lab-Makerere / Data4Good
View on GitHub
This repository contains publicly available speech and text data in Luganda.
☆12Sep 4, 2020Updated 5 years ago
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 3 years ago
german-asr / megs
View on GitHub
A merged version of multiple open-source German speech datasets.
☆34May 3, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gauthamsuresh09 / wav2vec2-large-xlsr-53-malayalam
View on GitHub
Wav2vec2 Large XLSR 53 fine-tuned for Malayalam
☆11Sep 7, 2021Updated 4 years ago
NeonGeckoCom / neon-tts-plugin-coqui
View on GitHub
Coqui AI TTS plugin
☆85Jul 2, 2025Updated last year
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago
coqui-ai / STT-models
View on GitHub
Open models for Coqui STT
☆153May 9, 2023Updated 3 years ago
uasolo / FDA-DH
View on GitHub
R Code recipes for Functional Data Analysis for phonetic analysis.
☆13Jul 31, 2024Updated last year
voidful / asrp
View on GitHub
ASR text preprocessing utility
☆21Aug 5, 2024Updated last year