ttsds / ttsds_systemsLinks

Recipes to create the synthetic data for the benchmarked TTS systems.

☆27

Alternatives and similar repositories for ttsds_systems

Users that are interested in ttsds_systems are comparing it to the libraries listed below

Sorting:

clement-pages / gryannote
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
☆62Updated 3 weeks ago
taresh18 / TTSizer
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨
☆90Updated last month
stlohrey / dia-finetuning
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆104Updated last month
davidbrowne17 / chatterbox-streaming
Streaming and Fine-tuning for Chatterbox TTS
☆109Updated last week
tonychenxyz / emoknob
This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…
☆74Updated 8 months ago
kyutai-labs / moshi-finetune
☆238Updated 2 months ago
NeuralVox / OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆99Updated 8 months ago
thomasgauthier / csm-hf
Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers
☆56Updated last month
stlohrey / chatterbox-finetuning
SoTA open-source TTS
☆46Updated 2 weeks ago
yl4579 / StyleTTS-ZS
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
☆180Updated 9 months ago
IIEleven11 / Automatic-Audio-Dataset-Maker
Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.
☆39Updated last week
dangtr0408 / StyleTTS2-lite
A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.
☆24Updated last month
zhenye234 / X-Codec-2.0
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆280Updated last week
knoriy / CLARA
☆62Updated 11 months ago
JarodMica / tortoise_dataset_tools
Misc. tools/scripts that I made to use for tortoise
☆21Updated 10 months ago
skirdey / voicerestore
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
☆174Updated 2 months ago
jakariaemon / WSI
Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.
☆18Updated 3 months ago
xinliu9451 / awesome-denoiser
This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …
☆40Updated 6 months ago
JosefAlbers / e2tts-mlx
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX
☆27Updated 8 months ago
e-c-k-e-r / vall-e
An unofficial PyTorch implementation of VALL-E
☆87Updated 3 weeks ago
fakerybakery / simpletts
A lightweight Python library for running TTS models with a unified API.
☆20Updated 4 months ago
EndlessReform / smoltts
Open TTS models, built for streaming on the edge
☆43Updated 3 months ago
RobViren / kvoicewalk
A random walk voice style cloning application for Kokoro text to speech
☆99Updated last week
ex3ndr / supervoice-voicebox
VoiceBox neural network implementation
☆109Updated 10 months ago
5Hyeons / StyleTTS2-Vocos
StyleTTS2 + Vocos as a Decoder
☆12Updated 3 months ago
yangdongchao / RSTnet
Real-time Speech-Text Foundation Model Toolkit (wip)
☆237Updated 3 months ago
rioharper / VocalForge
Your one-stop solution for voice dataset creation
☆119Updated last year
zhenye234 / LLaSA_inference
☆40Updated 4 months ago
huggingface / dataspeech
☆365Updated 9 months ago
anan235 / dia-multilingual
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆179Updated 2 months ago