JarodMica / StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆34Updated 5 months ago
Alternatives and similar repositories for StyleTTS2:
Users that are interested in StyleTTS2 are comparing it to the libraries listed below
- ☆63Updated last month
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆56Updated 5 months ago
- ☆95Updated last year
- A Gradio UI for XTTSv2 and RVC.☆158Updated 10 months ago
- TTS pipeline that uses RVC to enhance audio quality and cloning☆145Updated last year
- ☆104Updated last month
- Using RVC via console or python scripts☆123Updated 6 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 8 months ago
- A Gradio UI for XTTSv2 and RVC.☆69Updated 7 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆77Updated 6 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆38Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆44Updated 4 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆158Updated 9 months ago
- Examples of using the llasa-tts models locally☆163Updated last week
- Performs the entire AI cover generation process with UI☆17Updated this week
- ☆99Updated 8 months ago
- Audio datasets, easier.☆84Updated last year
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆28Updated 4 months ago
- Collection of the best Applio plugins.☆29Updated 7 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆174Updated 7 months ago
- Advanced RVC Inference for quicker and effortless model downloads☆49Updated 2 weeks ago
- Slightly improved official version for finetune xtts☆72Updated 7 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆87Updated last month
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Updated 7 months ago
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆82Updated 9 months ago
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆61Updated 3 weeks ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆35Updated this week
- Slightly improved official version for finetune xtts☆336Updated 3 weeks ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆95Updated last week
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year