π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
β162Jul 15, 2024Updated last year
Alternatives and similar repositories for StyleTTS2
Users that are interested in StyleTTS2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,222Aug 10, 2024Updated last year
- β99Apr 27, 2024Updated last year
- Fine Tune the Style-TTS2 Voice Modelβ269Jun 17, 2025Updated 9 months ago
- Create Unmute voice embeddingsβ24Nov 15, 2025Updated 4 months ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β257Jun 10, 2024Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animationβ24Jun 24, 2024Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorchβ134Dec 29, 2025Updated 2 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ20May 20, 2025Updated 10 months ago
- β16Apr 23, 2024Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ268Jan 13, 2025Updated last year
- Controllable and fast Text-to-Speech for over 7000 languages!β2,193Jan 25, 2026Updated last month
- An Open Source text-to-speech system built by inverting Whisper.β4,576Dec 14, 2025Updated 3 months ago
- β14Aug 19, 2024Updated last year
- Vocoder NSF-HiFiGAN (Moved into deepaudio)β56Dec 11, 2022Updated 3 years ago
- β11Mar 28, 2024Updated last year
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.β36Sep 21, 2022Updated 3 years ago
- An unofficial PyTorch implementation of VALL-Eβ88Aug 3, 2025Updated 7 months ago
- Official Implementation of StyleTTSβ462Jan 13, 2025Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for β¦β13Oct 4, 2024Updated last year
- A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice,β¦β3,021Feb 19, 2026Updated last month
- A ggml (C++) re-implementation of tortoise-ttsβ192Aug 20, 2024Updated last year
- β19Jul 11, 2024Updated last year
- Dungeon procedural generator similar to whatabou's "One Page Dungeon"β50Jan 4, 2026Updated 2 months ago
- β31Oct 29, 2024Updated last year
- NOVA-3D: Non-overlapped Views for 3D Anime Character Reconstructionβ27Mar 14, 2024Updated 2 years ago
- Text to speech Plugin for Flowβ13Aug 26, 2025Updated 6 months ago
- β12Mar 18, 2024Updated 2 years ago
- ComfyUI style LDM patching in A1111β53Jun 11, 2024Updated last year
- This simple program makes use of Calibre to convert a ebook into chapters and styletts2 to turn that into a audiobook with voice cloning β¦β35Aug 30, 2024Updated last year
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical reportβ48Sep 2, 2025Updated 6 months ago
- Oobabooga extension for Bark TTSβ119Nov 23, 2023Updated 2 years ago
- This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.β16Dec 8, 2024Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β30May 27, 2023Updated 2 years ago
- This project is based on SadTalker to implement video lip synthesis.β15Jan 9, 2024Updated 2 years ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.β42Jan 26, 2024Updated 2 years ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variabilityβ107Jan 17, 2025Updated last year
- In this repository, you will learn how code works in VITS(Conditional Variational Autoencoder with Adversarial Learning for End-to-End Teβ¦β155Jun 5, 2023Updated 2 years ago
- β20Jun 26, 2024Updated last year
- β24May 22, 2024Updated last year