strob/gentle

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/strob/gentle)

strob / gentle

gentle forced aligner

☆1,703

Alternatives and similar repositories for gentle

Users that are interested in gentle are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pettarin / forced-alignment-tools
View on GitHub
A collection of links and notes on forced alignment tools
☆941Apr 18, 2026Updated 3 months ago
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,847Jul 11, 2026Updated last week
readbeyond / aeneas
View on GitHub
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
☆2,852Jun 22, 2024Updated 2 years ago
mozilla / DSAlign
View on GitHub
DeepSpeech based forced alignment tool
☆239Dec 12, 2020Updated 5 years ago
prosodylab / Prosodylab-Aligner
View on GitHub
Python interface for forced audio alignment using HTK and SoX
☆351Jun 28, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nassosoassos / sail_align
View on GitHub
SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…
☆99Apr 5, 2022Updated 4 years ago
coqui-ai / open-speech-corpora
View on GitHub
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,397Jun 6, 2024Updated 2 years ago
AdolfVonKleist / Phonetisaurus
View on GitHub
Phonetisaurus G2P
☆516Jun 1, 2024Updated 2 years ago
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,432Sep 22, 2025Updated 9 months ago
lingjzhu / charsiu
View on GitHub
Charsiu: A neural phonetic aligner.
☆346Sep 19, 2022Updated 3 years ago
syhw / wer_are_we
View on GitHub
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
☆1,864Jun 27, 2022Updated 4 years ago
Kyubyong / g2p
View on GitHub
g2p: English Grapheme To Phoneme Conversion
☆927Jan 5, 2023Updated 3 years ago
bootphon / phonemizer
View on GitHub
Simple text to phones converter for multiple languages
☆1,557Sep 26, 2024Updated last year
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Helsinki-NLP / prosody
View on GitHub
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
☆249Oct 30, 2019Updated 6 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
xcmyz / FastSpeech
View on GitHub
The Implementation of FastSpeech based on pytorch.
☆885Jul 6, 2023Updated 3 years ago
gooofy / zamia-speech
View on GitHub
Open tools and data for cloudless automatic speech recognition
☆449Mar 30, 2021Updated 5 years ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,897Updated this week
NVIDIA / mellotron
View on GitHub
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆870Jul 22, 2023Updated 2 years ago
YannickJadoul / Parselmouth
View on GitHub
Praat in Python, the Pythonic way
☆1,272Jun 23, 2026Updated 3 weeks ago
lhotse-speech / lhotse
View on GitHub
Tools for handling multimodal data in machine learning projects.
☆1,143Jun 22, 2026Updated 3 weeks ago
Kyubyong / css10
View on GitHub
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
☆490Mar 6, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MontrealCorpusTools / MFA-reorganization-scripts
View on GitHub
Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner
☆43Jun 22, 2021Updated 5 years ago
pykaldi / pykaldi
View on GitHub
A Python wrapper for Kaldi
☆1,038Nov 30, 2025Updated 7 months ago
jfsantos / maracas
View on GitHub
maracas is a library for corrupting audio files with additive and convolutive noise.
☆72Aug 22, 2017Updated 8 years ago
YoavRamon / awesome-kaldi
View on GitHub
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
☆536Feb 9, 2022Updated 4 years ago
CSTR-Edinburgh / merlin
View on GitHub
This is now the official location of the Merlin project.
☆1,320Mar 3, 2020Updated 6 years ago
cmusphinx / g2p-seq2seq
View on GitHub
G2P with Tensorflow
☆680Jul 29, 2024Updated last year
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
descriptinc / melgan-neurips
View on GitHub
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
☆1,040Aug 28, 2023Updated 2 years ago
AswinKumar1 / Forced-Alignment
View on GitHub
GSoC'16 RedHen Labs
☆11Aug 22, 2016Updated 9 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
alumae / kaldi-gstreamer-server
View on GitHub
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
☆1,094Jun 8, 2024Updated 2 years ago
srvk / eesen
View on GitHub
The official repository of the Eesen project
☆834May 23, 2019Updated 7 years ago
felixkreuk / SegFeat
View on GitHub
Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)
☆83Nov 13, 2021Updated 4 years ago
JeremyCCHsu / Python-Wrapper-for-World-Vocoder
View on GitHub
A Python wrapper for the high-quality vocoder "World"
☆789Jan 21, 2025Updated last year
wiseman / py-webrtcvad
View on GitHub
Python interface to the WebRTC Voice Activity Detector
☆2,491Jul 4, 2024Updated 2 years ago
wq2012 / awesome-diarization
View on GitHub
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,885Jul 7, 2026Updated 2 weeks ago
amirharati / kaldi-alligner
View on GitHub
scripts to align a given wave to its transcription using trained models by Kaldi
☆37Aug 15, 2019Updated 6 years ago