JRMeyer/common-voice-forced-alignments

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JRMeyer/common-voice-forced-alignments)

JRMeyer / common-voice-forced-alignments

Forced Alignments for Common Voice

☆33

Alternatives and similar repositories for common-voice-forced-alignments

Users that are interested in common-voice-forced-alignments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Digital-Umuganda / Deepspeech-Kinyarwanda
View on GitHub
The kinyarwanda model for deepspeech
☆17May 11, 2021Updated 5 years ago
ftyers / ud-scripts
View on GitHub
Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies
☆17Mar 4, 2020Updated 6 years ago
kdawson2 / tshape_analysis
View on GitHub
Code designed for analysis of tongue contour data - produces three metrics (Procrustes analysis, Modified Curvature Index and Fourier ana…
☆10Apr 19, 2024Updated 2 years ago
ftyers / commonvoice-utils
View on GitHub
Linguistic processing for Common Voice
☆59Jan 18, 2024Updated 2 years ago
kamperh / globalphone_awe
View on GitHub
Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.
☆11Nov 3, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
clld / grambank
View on GitHub
☆13Jul 8, 2026Updated 3 weeks ago
harvard-edge / multilingual_kws
View on GitHub
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
☆190Dec 6, 2024Updated last year
sil-ai / tts-singlish
View on GitHub
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.
☆11Jan 11, 2020Updated 6 years ago
rhasspy / rhasspy-asr-kaldi
View on GitHub
Speech to text library for Rhasspy using Kaldi
☆15Dec 9, 2023Updated 2 years ago
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago
CUNY-CL / wikipron
View on GitHub
Massively multilingual pronunciation mining
☆371Updated this week
athena-team / athena-transform
View on GitHub
☆21Jan 13, 2020Updated 6 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
jhdeov / interlingual-MFA
View on GitHub
Workflow for forced alignment between languages
☆25May 7, 2026Updated 2 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
mktiede / GetContours
View on GitHub
Matlab tool for interactively extracting tongue contours from Ultrasound movie or DICOM sequences
☆17Apr 30, 2021Updated 5 years ago
ArchitParnami / Few-Shot-KWS
View on GitHub
Few-Shot Keyword Spotting
☆73Apr 11, 2021Updated 5 years ago
gre / zpeech
View on GitHub
ZPeech, vowel formant analysis experiment with Web Audio API
☆23Mar 24, 2014Updated 12 years ago
NickRuiz / power-asr
View on GitHub
Phonetically-Oriented Word Error Rate
☆36May 4, 2019Updated 7 years ago
linguisticexplorer / Linguistic-Explorer
View on GitHub
Terraling is a Ruby on Rails web application to let you store and browse your linguistic data. For More information read the README file.
☆17Nov 8, 2019Updated 6 years ago
ffxiong / uaspeech
View on GitHub
Baseline kaldi script for UA-SPEECH corpus
☆32Oct 16, 2024Updated last year
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
jhdeov / ArmenianVerbs
View on GitHub
Paradigms of Armenian conjugation classes, and sample verb list
☆17Apr 13, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
awasthiabhijeet / Error-Driven-ASR-Personalization
View on GitHub
Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021
☆11Jun 13, 2021Updated 5 years ago
iisys-hof / olaph
View on GitHub
OLaPh (Optimal Language Phonemizer) is a multilingual phonemization framework that converts text into phonemes surpassing the quality of …
☆17Jul 20, 2026Updated last week
google-research / last
View on GitHub
A JAX library for building lattice-based speech transducer models
☆48Jul 2, 2026Updated 3 weeks ago
JRMeyer / easy-kaldi
View on GitHub
Use your data to create a speech recognition system in Kaldi. Fast.
☆65Jan 2, 2020Updated 6 years ago
titu1994 / keras-normalized-optimizers
View on GitHub
Wrapper for Normalized Gradient Descent in Keras
☆17Jun 9, 2018Updated 8 years ago
MontrealCorpusTools / kalpy
View on GitHub
Pybind11 bindings for Kaldi
☆15Jul 11, 2026Updated 2 weeks ago
coqui-ai / snakepit
View on GitHub
🐍 Coqui's machine learning job scheduler
☆31Sep 5, 2021Updated 4 years ago
gbegus / DeepPhonologyTool
View on GitHub
Train a fiwGAN or ciwGAN model using your own training data
☆14Oct 13, 2022Updated 3 years ago
usc-sail / peft-ser
View on GitHub
[ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…
☆60Jul 1, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Anrijs / Aranet4-ESP32
View on GitHub
Aranet4 ESP32 client
☆22Jul 19, 2025Updated last year
pavelsof / ipatok
View on GitHub
IPA tokeniser
☆19Jul 28, 2025Updated last year
kamperh / speech_dtw
View on GitHub
Dynamic time warping (DTW) functions for specifically speech alignment.
☆30May 6, 2024Updated 2 years ago
anzeyimana / DeepKIN
View on GitHub
DeepKIN -- A deep learning toolkit for Kinyarwanda NLP.
☆14Jun 4, 2025Updated last year
abbrev / tascam-rc-10-remote
View on GitHub
TASCAM RC-10 remote control
☆20Oct 22, 2016Updated 9 years ago
lingjzhu / zipa
View on GitHub
A family of efficient speech models for multilingual phone recognition
☆68Jul 18, 2026Updated last week
evuraan / mintPiper
View on GitHub
Make Linux speak what's on the screen: clearly and securely.
☆35Apr 6, 2024Updated 2 years ago