π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
β160Jul 15, 2024Updated last year
Alternatives and similar repositories for StyleTTS2
Users that are interested in StyleTTS2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,295Aug 10, 2024Updated last year
- β98Apr 27, 2024Updated 2 years ago
- Fine Tune the Style-TTS2 Voice Modelβ267Jun 17, 2025Updated last year
- Create Unmute voice embeddingsβ26Nov 15, 2025Updated 7 months ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β262Jun 10, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animationβ24Jun 24, 2024Updated 2 years ago
- Application of MB-iSTFT-VITS components to vits2_pytorchβ135Dec 29, 2025Updated 6 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ21May 20, 2025Updated last year
- β16Apr 23, 2024Updated 2 years ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ268Jan 13, 2025Updated last year
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorchβ517Dec 20, 2025Updated 6 months ago
- Official Implementation of StyleTTSβ464Jan 13, 2025Updated last year
- Controllable and fast Text-to-Speech for over 7000 languages!β2,202Jan 25, 2026Updated 5 months ago
- An Open Source text-to-speech system built by inverting Whisper.β4,620Dec 14, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β14Aug 19, 2024Updated last year
- Vocoder NSF-HiFiGAN (Moved into deepaudio)β55Dec 11, 2022Updated 3 years ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.β36Sep 21, 2022Updated 3 years ago
- β12Mar 28, 2024Updated 2 years ago
- An unofficial PyTorch implementation of VALL-Eβ88Aug 3, 2025Updated 11 months ago
- Quick hack job to allow use with Sillytavern. This works for me, some further updates are expected to expose more settings to sillytavernβ11May 30, 2024Updated 2 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversionβ41Sep 9, 2025Updated 9 months ago
- A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro,β¦β3,186May 14, 2026Updated last month
- A ggml (C++) re-implementation of tortoise-ttsβ194Aug 20, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β19Jul 11, 2024Updated last year
- β32Oct 29, 2024Updated last year
- Text to speech Plugin for Flowβ14Aug 26, 2025Updated 10 months ago
- β12Mar 18, 2024Updated 2 years ago
- ComfyUI style LDM patching in A1111β52Jun 11, 2024Updated 2 years ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for β¦β14Oct 4, 2024Updated last year
- This simple program makes use of Calibre to convert a ebook into chapters and styletts2 to turn that into a audiobook with voice cloning β¦β35Aug 30, 2024Updated last year
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical reportβ47Sep 2, 2025Updated 10 months ago
- Oobabooga extension for Bark TTSβ119Nov 23, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.β17Dec 8, 2024Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β30May 27, 2023Updated 3 years ago
- This project is based on SadTalker to implement video lip synthesis.β15Jan 9, 2024Updated 2 years ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.β42Jan 26, 2024Updated 2 years ago
- β19May 2, 2024Updated 2 years ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variabilityβ108Jan 17, 2025Updated last year
- β20Jun 26, 2024Updated 2 years ago