StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆37May 17, 2025Updated 10 months ago
Alternatives and similar repositories for StyleTTS2
Users that are interested in StyleTTS2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆75Mar 21, 2025Updated last year
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Sep 22, 2024Updated last year
- ☆42Mar 27, 2026Updated 2 weeks ago
- This project contains an NVIDIA AI Workbench project for easy installation.☆12May 30, 2024Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Aug 19, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆54Updated this week
- A simple module for making a request to the tortoise gradio page.☆17Jun 10, 2024Updated last year
- ☆101Aug 14, 2024Updated last year
- Code from Bellingcat's guide☆11Dec 8, 2022Updated 3 years ago
- Silero TTS web UI☆15Jan 30, 2024Updated 2 years ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆39Apr 6, 2026Updated last week
- Drax: Speech Recognition with Discrete Flow Matching☆75Oct 15, 2025Updated 6 months ago
- Simple Dash Effect Unity☆14Mar 18, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 将AI Studio反代成OpenAI兼容的API | OpenAI-compatible API proxy for Google AI Studio☆105Apr 1, 2026Updated 2 weeks ago
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- Performs the entire AI cover generation process with UI☆31Aug 4, 2025Updated 8 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆187Sep 27, 2024Updated last year
- process video frame by frame inside "Extras" tab☆20Sep 22, 2024Updated last year
- Dự án công cụ chuyển đổi giọng nói dành cho người Việt☆28Mar 21, 2026Updated 3 weeks ago
- A multi-voice TTS system trained with an emphasis on quality☆23Nov 6, 2023Updated 2 years ago
- TTS pipeline that uses RVC to enhance audio quality and cloning☆146Jan 25, 2024Updated 2 years ago
- Technical Cheat Sheets for Devs in Hurry☆15Feb 10, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Automatic audiovisual translation with lip-syncing☆10Dec 21, 2019Updated 6 years ago
- Custom ComfyUI node set for managing long-running, prompt-driven video projects. Includes VantageProject for project management and two s…☆43Sep 25, 2025Updated 6 months ago
- Gradio UI for training video models using finetrainers☆33Apr 18, 2025Updated 11 months ago
- [SIGGRAPH Asia'25] Enabling Reference-based Camera Control via Context without Explicit 3D Estimation☆156Jan 18, 2026Updated 2 months ago
- ☆18Apr 18, 2024Updated last year
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Oct 3, 2023Updated 2 years ago
- zerodim-ffhq-x256 model in sd-webui☆20Aug 1, 2024Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆30Jun 9, 2025Updated 10 months ago
- A program that sets the stress and the letter ё of Russian text and ebooks using Wiktionary data and grammar analysis.☆39Feb 26, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Streaming ProPainter☆15Sep 18, 2024Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 6 months ago
- An agentic workflow for story book generation☆31Mar 15, 2025Updated last year
- AutoTile tileset generator for Unity☆10Jul 5, 2019Updated 6 years ago
- PostHTML plugin for code syntax highlighting with Prism.☆11Aug 30, 2024Updated last year
- Advanced 3D hack-n-slash template inspired by "Links Awakening" with an advanced maneuvering, weapons and effects systems.☆22Feb 13, 2020Updated 6 years ago
- fixed version of novitalabs cleaner; for Forge2 using Gradio4☆22May 20, 2025Updated 10 months ago