just-ai / speechflowLinks

☆29

Alternatives and similar repositories for speechflow

Users that are interested in speechflow are comparing it to the libraries listed below

Sorting:

chomeyama / wavehax
Official repository of Wavehax vocoder
☆53Updated last month
manmay-nakhashi / TTSizer
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨
☆17Updated 3 months ago
wetdog / wavenext_pytorch
Unofficial implementation of wavenext vocoder
☆49Updated last year
shigabeev / russian_tts_normalization
Normalize Text in Russian
☆27Updated last year
choiHkk / Transformer-TTS-V2
☆25Updated last year
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
IDRnD / VoxTube
The VoxTube dataset official repository
☆70Updated last year
fakerybakery / utmos
A toolkit to calculate speech audio quality. Not affiliated with the original authors
☆59Updated last year
deepvk / emospeech
☆126Updated last year
Edresson / ZS-TTS-Evaluation
☆43Updated 11 months ago
iisys-hof / HUI-Audio-Corpus-German
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…
☆32Updated 2 years ago
utter-project / mHuBERT-147-scripts
Collection of scripts from mHuBERT-147.
☆29Updated 9 months ago
NeuralVox / OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆103Updated 11 months ago
Aria-K-Alethia / laughter-synthesis
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆76Updated 2 years ago
deepvk / muse
🎵 muse: Music Separation
☆11Updated last year
ruslan-corpus / ruslan-corpus.github.io
☆21Updated 6 years ago
maxrmorrison / torbi
Viterbi decoding in PyTorch
☆37Updated last week
jzmzhong / Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
☆50Updated last year
NVIDIA / RAD-MMM
A TTS model that makes a speaker speak new languages
☆76Updated last year
spring-media / DeepForcedAligner
☆80Updated last month
seastar105 / pflow-encodec
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
☆76Updated last year
AlanBaade / SyllableLM
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆58Updated 2 months ago
RF5 / transfusion-asr
Transcribing Speech with Multinomial Diffusion, training code and models.
☆79Updated last year
Takaaki-Saeki / zm-text-tts
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆63Updated 2 years ago
bfs18 / e2_tts
☆70Updated last year
frankyoujian / Edge-Punct-Casing
☆29Updated 7 months ago
tomaarsen / TTSTextNormalization
Convert English text from written expressions into spoken forms
☆26Updated 3 years ago
Tobertz-max / DiFlow-TTS
DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…
☆37Updated last week
bshall / urhythmic
Unsupervised Rhythm Modeling for Voice Conversion
☆84Updated 2 years ago
e-c-k-e-r / vall-e
An unofficial PyTorch implementation of VALL-E
☆88Updated last month