sidharthrajaram/StyleTTS2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sidharthrajaram/StyleTTS2)

sidharthrajaram / StyleTTS2

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

☆159

Alternatives and similar repositories for StyleTTS2

Users that are interested in StyleTTS2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,316Aug 10, 2024Updated last year
NeuralVox / StyleTTS2
View on GitHub
☆98Apr 27, 2024Updated 2 years ago
IIEleven11 / StyleTTS2FineTune
View on GitHub
Fine Tune the Style-TTS2 Voice Model
☆267Jun 17, 2025Updated last year
EZ-VC / EZ-VC
View on GitHub
[EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion
☆43Sep 9, 2025Updated 10 months ago
davidmartinrius / speech-dataset-generator
View on GitHub
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
☆262Jun 10, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FENRlR / MB-iSTFT-VITS2
View on GitHub
Application of MB-iSTFT-VITS components to vits2_pytorch
☆135Dec 29, 2025Updated 6 months ago
davidbrowne17 / Mimi-Voice
View on GitHub
Create Unmute voice embeddings
☆26Nov 15, 2025Updated 8 months ago
idiap / coqui-ai-Trainer
View on GitHub
🐸 - A general purpose model trainer, as flexible as it gets
☆16Apr 10, 2026Updated 3 months ago
yl4579 / PL-BERT
View on GitHub
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆270Jan 13, 2025Updated last year
lucidrains / e2-tts-pytorch
View on GitHub
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
☆516Dec 20, 2025Updated 7 months ago
yl4579 / StyleTTS
View on GitHub
Official Implementation of StyleTTS
☆466Jan 13, 2025Updated last year
DigitalPhonetics / IMS-Toucan
View on GitHub
Controllable and fast Text-to-Speech for over 7000 languages!
☆2,207Jan 25, 2026Updated 5 months ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
WhisperSpeech / WhisperSpeech
View on GitHub
An Open Source text-to-speech system built by inverting Whisper.
☆4,624Dec 14, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
vtuber-plan / NSF-HiFiGAN
View on GitHub
Vocoder NSF-HiFiGAN (Moved into deepaudio)
☆56Dec 11, 2022Updated 3 years ago
longtimegone / StyleTTS2-Sillytavern-api
View on GitHub
Quick hack job to allow use with Sillytavern. This works for me, some further updates are expected to expose more settings to sillytavern
☆11May 30, 2024Updated 2 years ago
e-c-k-e-r / vall-e
View on GitHub
An unofficial PyTorch implementation of VALL-E
☆88Aug 3, 2025Updated 11 months ago
hcy71o / SC-VITS
View on GitHub
VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.
☆36Sep 21, 2022Updated 3 years ago
jaechanjo / TIFF
View on GitHub
Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation
☆24Jun 24, 2024Updated 2 years ago
Vyvo-Labs / CodecHub
View on GitHub
CodecHub: A Unified Library for Codec Models
☆25Dec 24, 2025Updated 7 months ago
thepowerfuldeez / rvc-trainer
View on GitHub
☆12Mar 28, 2024Updated 2 years ago
rsxdalv / TTS-WebUI
View on GitHub
A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro,…
☆3,215Jul 6, 2026Updated 2 weeks ago
asmpro7 / AudFlow
View on GitHub
Text to speech Plugin for Flow
☆14Aug 26, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
haoweilou / ParaStyleTTS
View on GitHub
This is the official code for ACM CIKM 2025 Paper: ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive …
☆59Dec 21, 2025Updated 7 months ago
camenduru / Open-Sora-jupyter
View on GitHub
☆12Mar 18, 2024Updated 2 years ago
mbzuai-nlp / sttatts
View on GitHub
☆31Oct 29, 2024Updated last year
Haurrus / xtts-trainer-no-ui-auto
View on GitHub
This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …
☆14Oct 4, 2024Updated last year
hannahun1 / anodes
View on GitHub
☆16Apr 23, 2024Updated 2 years ago
balisujohn / tortoise.cpp
View on GitHub
A ggml (C++) re-implementation of tortoise-tts
☆194Aug 20, 2024Updated last year
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
BuffMcBigHuge / text-generation-webui-edge-tts
View on GitHub
A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.
☆42Jan 26, 2024Updated 2 years ago
gauravk95 / SadTalker-Video
View on GitHub
This project is based on SadTalker to implement video lip synthesis.
☆14Jan 9, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
wsippel / bark_tts
View on GitHub
Oobabooga extension for Bark TTS
☆117Nov 23, 2023Updated 2 years ago
winddori2002 / DEX-TTS
View on GitHub
DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability
☆108Jan 17, 2025Updated last year
Ereboas / TacoLM
View on GitHub
☆19May 2, 2024Updated 2 years ago
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆47Dec 1, 2022Updated 3 years ago
taresh18 / orpheus-streaming
View on GitHub
Orpheus TTS Server with streaming support (TTFB ~160ms)
☆26Sep 21, 2025Updated 10 months ago
erew123 / alltalk_tts
View on GitHub
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…
☆2,418Jan 9, 2026Updated 6 months ago
shivammehta25 / OverFlow
View on GitHub
Putting flows on top of neural transducers for better TTS
☆64Jul 13, 2026Updated last week