JarodMica / StyleTTS2Links
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆37Updated 6 months ago
Alternatives and similar repositories for StyleTTS2
Users that are interested in StyleTTS2 are comparing it to the libraries listed below
Sorting:
- ☆72Updated 7 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆105Updated 8 months ago
- A Gradio UI for XTTSv2 and RVC.☆159Updated last year
- ☆100Updated last year
- Examples of using the llasa-tts models locally☆181Updated 7 months ago
- SoTA open-source TTS for Audiobook and Podcast Generation☆170Updated 5 months ago
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆256Updated last month
- A Gradio UI for XTTSv2 and RVC.☆66Updated last year
- TTS + Voice Cloning☆171Updated 3 months ago
- A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, IndexTTS-2, Chatter…☆402Updated this week
- Gradio UI for YuE☆78Updated 7 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆38Updated last year
- ☆128Updated 8 months ago
- Collection of the best Applio plugins.☆32Updated 4 months ago
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆32Updated 11 months ago
- The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement☆643Updated 3 weeks ago
- Slightly improved official version for finetune xtts☆70Updated last year
- YuE with mp3 extend, exllama and GUI☆62Updated 8 months ago
- ☆18Updated last year
- SoTA open-source TTS☆117Updated last month
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆51Updated 11 months ago
- ☆71Updated 3 months ago
- A multi-voice TTS system trained with an emphasis on quality☆24Updated 2 years ago
- ☆99Updated last year
- ☆261Updated 2 weeks ago
- Performs the entire AI cover generation process with UI☆26Updated 3 months ago
- An app for creating audio-based content such as song covers and speech using Retrieval-based Voice Conversion.☆184Updated last month
- OminiControl for the GPU Poor☆39Updated 9 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆107Updated last week
- TTS pipeline that uses RVC to enhance audio quality and cloning☆146Updated last year