NeuralVox/StyleTTS2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NeuralVox/StyleTTS2)

NeuralVox / StyleTTS2

☆98

Alternatives and similar repositories for StyleTTS2

Users that are interested in StyleTTS2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IIEleven11 / StyleTTS2FineTune
View on GitHub
Fine Tune the Style-TTS2 Voice Model
☆267Jun 17, 2025Updated last year
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
sidharthrajaram / StyleTTS2
View on GitHub
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
☆159Jul 15, 2024Updated 2 years ago
AndrewVeee / ai_tools
View on GitHub
Small tools to enhance your AI app with little effort.
☆12Jan 9, 2024Updated 2 years ago
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,316Aug 10, 2024Updated last year
mush42 / istft-onnx
View on GitHub
Export an ONNX graph that performs ISTFT. Designed for TTS models.
☆28Apr 23, 2024Updated 2 years ago
themanyone / caption_anything
View on GitHub
Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation …
☆24Sep 11, 2025Updated 10 months ago
dicksondickson / ComfyUI-Dickson-Nodes
View on GitHub
A set of custom nodes that I've either written myself or adapted from other authors for my own convenience.
☆11Sep 18, 2024Updated last year
Stylish-TTS / stylish-tts
View on GitHub
High quality text-to-speech based on StyleTTS 2.
☆78Apr 6, 2026Updated 3 months ago
liuhuang31 / HiFTNet-sr
View on GitHub
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Jan 2, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
merlresearch / reverberation-as-supervision
View on GitHub
Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation
☆15Aug 1, 2024Updated last year
freds0 / CML-TTS-Dataset
View on GitHub
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆35Jul 31, 2024Updated last year
gauravk95 / SadTalker-Video
View on GitHub
This project is based on SadTalker to implement video lip synthesis.
☆14Jan 9, 2024Updated 2 years ago
ShoukanLabs / VoPho
View on GitHub
A collection of all our phonemeizers for dataset construction and inference
☆30Feb 21, 2025Updated last year
tonnetonne814 / unofficial-vits2-44100-Ja
View on GitHub
44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。
☆24Sep 1, 2023Updated 2 years ago
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
ValyrianTech / OpenVoice_server
View on GitHub
API server for Instant voice cloning by MyShell.
☆107Sep 26, 2024Updated last year
davidmartinrius / speech-dataset-generator
View on GitHub
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
☆262Jun 10, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
ml-for-speech / speechtoolkit
View on GitHub
[Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…
☆22Jan 10, 2025Updated last year
yihuitang / StyleTTS_Mandarin
View on GitHub
Implementation of StyleTTS for Mandarin
☆11Jun 22, 2023Updated 3 years ago
hyperfocAIs / Attend
View on GitHub
Attend - to what matters.
☆17Feb 22, 2025Updated last year
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
yl4579 / StyleTTS
View on GitHub
Official Implementation of StyleTTS
☆466Jan 13, 2025Updated last year
Cybonto / OllaDeck
View on GitHub
OllaDeck is a purple technology stack for Generative AI (text modality) cybersecurity. It provides a comprehensive set of tools for both …
☆17Sep 21, 2024Updated last year
erew123 / alltalk_tts
View on GitHub
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…
☆2,418Jan 9, 2026Updated 6 months ago
kohei0209 / self-remixing
View on GitHub
Official implementation of Self-Remixing
☆18Feb 3, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
l33tkr3w / LlamaCards
View on GitHub
LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …
☆36Aug 28, 2024Updated last year
JarodMica / rvc
View on GitHub
Installable package for rvc voice inferencing
☆10Aug 11, 2024Updated last year
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
gitmylo / FlowNodes
View on GitHub
Flow control nodes for comfyUI, allowing for more diverse workflows
☆13Apr 3, 2025Updated last year
naver-ai / RapFlow-TTS
View on GitHub
☆56Jul 16, 2025Updated last year
flamed-tts / Flamed-TTS
View on GitHub
This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …
☆57Aug 9, 2025Updated 11 months ago
nivibilla / local-llasa-tts
View on GitHub
Examples of using the llasa-tts models locally
☆178Apr 20, 2025Updated last year