lowerquality / gentle

gentle forced aligner

☆1,491

Alternatives and similar repositories for gentle:

Users that are interested in gentle are comparing it to the libraries listed below

pettarin / forced-alignment-tools
A collection of links and notes on forced alignment tools
☆883Updated 3 years ago
MontrealCorpusTools / Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
☆1,388Updated last month
readbeyond / aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
☆2,552Updated 6 months ago
bootphon / phonemizer
Simple text to phones converter for multiple languages
☆1,279Updated 3 months ago
Kyubyong / g2p
g2p: English Grapheme To Phoneme Conversion
☆831Updated 2 years ago
prosodylab / Prosodylab-Aligner
Python interface for forced audio alignment using HTK and SoX
☆334Updated 4 years ago
coqui-ai / open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,298Updated 7 months ago
wiseman / py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
☆2,120Updated 6 months ago
xinjli / allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
☆585Updated 8 months ago
amsehili / auditok
An audio/acoustic activity detection and audio segmentation tool
☆759Updated last month
coqui-ai / TTS-papers
🐸 collection of TTS papers
☆660Updated 6 months ago
Rayhane-mamah / Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
☆2,289Updated last year
MycroftAI / mimic-recording-studio
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …
☆502Updated last year
linto-ai / whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
☆2,178Updated last month
alumae / kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
☆1,077Updated 7 months ago
ina-foss / inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …
☆774Updated last week
gooofy / zamia-speech
Open tools and data for cloudless automatic speech recognition
☆446Updated 3 years ago
keithito / tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
☆2,965Updated last year
Edresson / YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
☆935Updated 2 months ago
Kyubyong / tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
☆1,827Updated 3 years ago
mozilla / DSAlign
DeepSpeech based forced alignment tool
☆235Updated 4 years ago
AdolfVonKleist / Phonetisaurus
Phonetisaurus G2P
☆457Updated 7 months ago
Tomiinek / Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
☆833Updated last year
r9y9 / wavenet_vocoder
WaveNet vocoder
☆2,337Updated last year
cmusphinx / g2p-seq2seq
G2P with Tensorflow
☆669Updated 5 months ago
pykaldi / pykaldi
A Python wrapper for Kaldi
☆1,006Updated 5 months ago
wq2012 / awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,670Updated 3 months ago
google / voice-builder
An opensource text-to-speech (TTS) voice building tool
☆662Updated 5 months ago
Shahabks / myprosody
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
☆240Updated 2 years ago
coqui-ai / STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
☆2,309Updated 10 months ago