JarodMica / StyleTTS2Links
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆36Updated 3 weeks ago
Alternatives and similar repositories for StyleTTS2
Users that are interested in StyleTTS2 are comparing it to the libraries listed below
Sorting:
- ☆67Updated 2 months ago
- ☆96Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆62Updated 6 months ago
- Examples of using the llasa-tts models locally☆172Updated last month
- TTS pipeline that uses RVC to enhance audio quality and cloning☆144Updated last year
- ☆99Updated 9 months ago
- Full GUI Version☆31Updated 2 years ago
- Using RVC via console or python scripts☆126Updated 7 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 9 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆97Updated 2 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆96Updated 3 weeks ago
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆88Updated 10 months ago
- A multi-voice TTS system trained with an emphasis on quality☆24Updated last year
- A Gradio UI for XTTSv2 and RVC.☆157Updated last year
- A Gradio UI for XTTSv2 and RVC.☆68Updated 8 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆79Updated 7 months ago
- ☆112Updated 2 months ago
- A random walk voice style cloning application for Kokoro text to speech☆85Updated last week
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- Collection of the best Applio plugins.☆29Updated 8 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆37Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆48Updated 5 months ago
- Audio datasets, easier.☆84Updated last year
- Gradio UI for YuE☆56Updated 2 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆36Updated last week
- SoTA open-source TTS for Audiobook and Podcast Generation☆17Updated this week
- Performs the entire AI cover generation process with UI☆18Updated 3 weeks ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆179Updated 8 months ago
- YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open☆64Updated last month
- ☆68Updated 7 months ago