JarodMica / StyleTTS2Links
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆37Updated 8 months ago
Alternatives and similar repositories for StyleTTS2
Users that are interested in StyleTTS2 are comparing it to the libraries listed below
Sorting:
- ☆73Updated 9 months ago
- Examples of using the llasa-tts models locally☆182Updated 9 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆105Updated 2 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆53Updated last year
- ☆100Updated last year
- A multi-voice TTS system trained with an emphasis on quality☆23Updated 2 years ago
- Gradio UI for YuE☆88Updated 9 months ago
- A Gradio UI for XTTSv2 and RVC.☆161Updated last year
- A Gradio UI for XTTSv2 and RVC.☆66Updated last year
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆38Updated last year
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆92Updated last year
- TTS pipeline that uses RVC to enhance audio quality and cloning☆146Updated last year
- Slightly improved official version for finetune xtts☆71Updated last year
- ☆100Updated last year
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆326Updated 3 months ago
- ☆135Updated 10 months ago
- ToonOut, a fork of BiRefNet focused on background removal for anime images. We open-source our dataset & our weights. See our paper at: h…☆76Updated 4 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆66Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆107Updated last week
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆33Updated last year
- Using RVC via console or python scripts☆140Updated last year
- ☆72Updated 5 months ago
- TTS + Voice Cloning☆200Updated 2 weeks ago
- SoTA open-source TTS for Audiobook and Podcast Generation☆182Updated 7 months ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆207Updated last year
- YuE with mp3 extend, exllama and GUI☆64Updated 10 months ago
- OminiControl for the GPU Poor☆39Updated 11 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆30Updated 7 months ago
- Slightly improved official version for finetune xtts☆380Updated 9 months ago