JarodMica / StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆30Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for StyleTTS2
- ☆52Updated 2 months ago
- A Gradio UI for XTTSv2 and RVC.☆146Updated 5 months ago
- TTS pipeline that uses RVC to enhance audio quality and cloning☆139Updated 9 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆39Updated 7 months ago
- Slightly improved official version for finetune xtts☆62Updated 2 months ago
- ☆59Updated last month
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆73Updated last week
- A multi-voice TTS system trained with an emphasis on quality☆24Updated last year
- ☆87Updated 6 months ago
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆66Updated 4 months ago
- Diffusion_TTS extension for booga☆63Updated 5 months ago
- A Gradio UI for XTTSv2 and RVC.☆63Updated last month
- Using RVC via console or python scripts☆80Updated last month
- ☆33Updated 3 months ago
- Slightly improved official version for finetune xtts☆239Updated last month
- ☆93Updated 3 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated 9 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆71Updated last month
- Misc. tools/scripts that I made to use for tortoise☆18Updated 3 months ago
- A SwarmUI extension that adds parameters for ReActor and FaceRestoreCF nodes to the the generate tab☆15Updated 2 weeks ago
- Robust functionality, focused on granting convenient access to AI models developed using the Applio technology.☆14Updated 7 months ago
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆42Updated 2 months ago
- ☆12Updated 5 months ago
- Collection of the best Applio plugins.☆19Updated 2 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆50Updated 6 months ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆45Updated 4 months ago
- ☆14Updated 5 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆64Updated last year
- Text-to-Music Generation with Rectified Flow Transformer☆48Updated 2 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆46Updated 5 months ago