JarodMica / StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆30Updated this week
Related projects ⓘ
Alternatives and complementary repositories for StyleTTS2
- ☆51Updated last month
- A Gradio UI for XTTSv2 and RVC.☆142Updated 5 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆39Updated 7 months ago
- Diffusion_TTS extension for booga☆63Updated 4 months ago
- ☆86Updated 6 months ago
- ☆93Updated 2 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆71Updated this week
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆44Updated 4 months ago
- ☆59Updated 2 weeks ago
- ☆38Updated 5 months ago
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UI☆63Updated last year
- Text to Img with Stable Cascade(on gradio interface), required less vram than original example on official Hugginface☆40Updated 5 months ago
- Slightly improved official version for finetune xtts☆229Updated 2 weeks ago
- ☆30Updated last month
- This Flux latent upscaler workflow creates a lower-resolution initial pass, then advances to a second pass that upscales in latent space …☆82Updated 2 months ago
- Chat with your RVC models. See website for demo:☆20Updated 8 months ago
- An image viewer and AI-assisted editing tool that helps with curating datasets for generative AI models, finetunes and LoRA.☆73Updated this week
- Comfy UI in WhatsApp.☆49Updated last month
- TTS pipeline that uses RVC to enhance audio quality and cloning☆139Updated 9 months ago
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆61Updated 3 months ago
- ☆31Updated 2 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆50Updated 6 months ago
- ☆26Updated 4 months ago
- Slightly improved official version for finetune xtts☆60Updated last month
- Text-to-Music Generation with Rectified Flow Transformer☆45Updated 2 months ago
- A Gradio UI for XTTSv2 and RVC.☆61Updated last month
- Transcribe audio and add subtitles to videos using Whisper in ComfyUI☆74Updated 3 months ago
- all workflow packs for ComfyUI from @driftjohnson @fivebelowfiveuk☆14Updated this week
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆264Updated 9 months ago
- ☆14Updated 4 months ago