mozilla/DSAlign

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mozilla/DSAlign)

mozilla / DSAlign

DeepSpeech based forced alignment tool

☆239

Alternatives and similar repositories for DSAlign

Users that are interested in DSAlign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pettarin / forced-alignment-tools
View on GitHub
A collection of links and notes on forced alignment tools
☆941Apr 18, 2026Updated 3 months ago
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
strob / gentle
View on GitHub
gentle forced aligner
☆1,703May 19, 2026Updated 2 months ago
talonvoice / wav2train
View on GitHub
automatically align transcribed audio and generate a wav2letter training corpus
☆36Apr 11, 2023Updated 3 years ago
mlcommons / peoples-speech
View on GitHub
The People’s Speech Dataset
☆115Jan 11, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,847Jul 11, 2026Updated last week
coqui-ai / open-speech-corpora
View on GitHub
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,397Jun 6, 2024Updated 2 years ago
coqui-ai / inference-engine
View on GitHub
Coqui Inference Engine
☆41Aug 3, 2021Updated 4 years ago
amirharati / kaldi-alligner
View on GitHub
scripts to align a given wave to its transcription using trained models by Kaldi
☆37Aug 15, 2019Updated 6 years ago
readbeyond / aeneas
View on GitHub
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
☆2,852Jun 22, 2024Updated 2 years ago
gullabi / STT-align
View on GitHub
Coqui STT (🐸STT) based forced alignment tool
☆13Feb 24, 2022Updated 4 years ago
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
prosodylab / Prosodylab-Aligner
View on GitHub
Python interface for forced audio alignment using HTK and SoX
☆351Jun 28, 2020Updated 6 years ago
bootphon / phonemizer
View on GitHub
Simple text to phones converter for multiple languages
☆1,557Sep 26, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
at16k / at16k
View on GitHub
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
☆130Mar 31, 2021Updated 5 years ago
open-speech / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆410Apr 8, 2020Updated 6 years ago
yoyolicoris / wavenet-like-vocoder
View on GitHub
Basic wavenet and fftnet vocoder model.
☆19Feb 7, 2022Updated 4 years ago
freewym / espresso
View on GitHub
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆939Sep 4, 2024Updated last year
ucbvislab / p2fa-vislab
View on GitHub
A script for audio/transcript alignment. Fork of p2fa.
☆69Mar 15, 2018Updated 8 years ago
ljuvela / GELP
View on GitHub
☆27Apr 21, 2021Updated 5 years ago
alumae / streaming-punctuator
View on GitHub
☆17Apr 14, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
cornerfarmer / ctc_segmentation
View on GitHub
Segment a given audio into utterances using a trained end-to-end ASR model.
☆75Oct 9, 2020Updated 5 years ago
BayesForDays / gently
View on GitHub
Gentle and praatio scripts for easy forced alignment
☆18Oct 27, 2022Updated 3 years ago
lingjzhu / charsiu
View on GitHub
Charsiu: A neural phonetic aligner.
☆346Sep 19, 2022Updated 3 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
ivanvovk / WaveGrad
View on GitHub
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
☆409Jul 7, 2021Updated 5 years ago
liusongxiang / efficient_tts
View on GitHub
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
☆116Dec 22, 2021Updated 4 years ago
mozilla / voice-corpus-tool
View on GitHub
Tool for creation, manipulation and maintenance of voice corpora
☆82May 3, 2024Updated 2 years ago
Kyubyong / g2p
View on GitHub
g2p: English Grapheme To Phoneme Conversion
☆927Jan 5, 2023Updated 3 years ago
iisys-hof / HUI-Audio-Corpus-German
View on GitHub
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…
☆35Mar 31, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dabinat / deepspeech-tools
View on GitHub
Scripts to simplify data prepping for Mozilla DeepSpeech.
☆14Aug 6, 2019Updated 6 years ago
huiw39 / ExtensibleTTS-PyTorch
View on GitHub
An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery
☆26Jun 24, 2019Updated 7 years ago
facebookresearch / vocoder-benchmark
View on GitHub
A repository for benchmarking neural vocoders by their quality and speed.
☆213May 30, 2025Updated last year
thuhcsi / NeuFA
View on GitHub
Neural network-based forced alignment with bidirectional attention mechanism
☆78Jan 17, 2025Updated last year
lmnt-com / wavegrad
View on GitHub
A fast, high-quality neural vocoder.
☆299Jul 18, 2023Updated 3 years ago
candlewill / RawNet
View on GitHub
RawNet: Fast End-to-End Neural Vocoder
☆43May 29, 2019Updated 7 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago