JarodMica / StyleTTS2Links
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆37Updated 2 months ago
Alternatives and similar repositories for StyleTTS2
Users that are interested in StyleTTS2 are comparing it to the libraries listed below
Sorting:
- ☆67Updated 4 months ago
- Examples of using the llasa-tts models locally☆177Updated 3 months ago
- SoTA open-source TTS for Audiobook and Podcast Generation☆141Updated last month
- A Gradio UI for XTTSv2 and RVC.☆160Updated last year
- ☆101Updated 11 months ago
- ☆98Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆50Updated 7 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆101Updated 4 months ago
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆89Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆80Updated 9 months ago
- Higgs Audio v2 WebUI + One click installer WIN x64☆13Updated 2 weeks ago
- ☆117Updated 4 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆38Updated last year
- TTS pipeline that uses RVC to enhance audio quality and cloning☆145Updated last year
- Gradio UI for YuE☆68Updated 4 months ago
- Performs the entire AI cover generation process with UI☆22Updated 3 weeks ago
- A Gradio UI for XTTSv2 and RVC.☆67Updated 10 months ago
- TTS + Voice Cloning☆147Updated 2 weeks ago
- Using RVC via console or python scripts☆129Updated 9 months ago
- Slightly improved official version for finetune xtts☆72Updated 10 months ago
- Audio datasets, easier.☆84Updated last year
- ☆17Updated last year
- A collection of compiled wheels for deepspeed built for python 3.10 and 3.11 with support for cuda 11.8 and 12.1 for Windows☆69Updated 10 months ago
- Slightly improved official version for finetune xtts☆364Updated 4 months ago
- OminiControl for the GPU Poor☆36Updated 6 months ago
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆66Updated last month
- YuE with mp3 extend, exllama and GUI☆58Updated 5 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆62Updated 8 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆28Updated 2 months ago