HuiResearch/FlashTTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HuiResearch/FlashTTS)

HuiResearch / FlashTTS

基于SparkTTS、OrpheusTTS等模型，提供高质量中文语音合成与声音克隆服务。

☆612

Alternatives and similar repositories for FlashTTS

Users that are interested in FlashTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SparkAudio / Spark-TTS
View on GitHub
Spark-TTS Inference Code
☆11,000Apr 9, 2025Updated last year
qi-hua / async_cosyvoice
View on GitHub
使用vllm加速cosyvoice2的推理
☆497Apr 26, 2025Updated last year
Ksuriuri / index-tts-vllm
View on GitHub
Added vLLM support to IndexTTS for faster inference.
☆1,203Apr 13, 2026Updated 3 months ago
bytedance / MegaTTS3
View on GitHub
☆6,082Jun 15, 2026Updated last month
tuanh123789 / Spark-TTS-finetune
View on GitHub
finetune llm part for spark-tts model
☆125Mar 25, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
SparkAudio / VoxBox
View on GitHub
A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.
☆115May 5, 2025Updated last year
FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,292May 25, 2026Updated last month
Lourdle / CosyVoiceForOnnx
View on GitHub
☆15Dec 22, 2025Updated 6 months ago
easygoingbl / auditlimit
View on GitHub
内容审核及速率限制服务
☆26May 18, 2025Updated last year
FireRedTeam / FireRedTTS
View on GitHub
An Open-Sourced LLM-empowered Foundation TTS System
☆908Sep 28, 2025Updated 9 months ago
MYZY-AI / Muyan-TTS
View on GitHub
☆480May 19, 2025Updated last year
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,015Dec 2, 2025Updated 7 months ago
SparkAudio / SparkVox
View on GitHub
☆37Jun 9, 2025Updated last year
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,247Dec 5, 2025Updated 7 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yynil / RWKVTTS
View on GitHub
This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).
☆101Oct 8, 2025Updated 9 months ago
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆14,981Jul 5, 2026Updated 2 weeks ago
index-tts / index-tts
View on GitHub
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
☆22,015Updated this week
miemiekurisu / qwen3asr_cpu
View on GitHub
A high-performance C/C++ inference server for Qwen3-ASR , optimized for CPU/GPU real-time streaming speech recognition.
☆15Jun 27, 2026Updated 3 weeks ago
Ilikepizza2 / spark-tts-server
View on GitHub
(MacOS Support) OpenAI compatible http server for Spark-TTS
☆15May 1, 2025Updated last year
VITA-MLLM / VITA-Audio
View on GitHub
✨✨[NeurIPS 2025] VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model
☆682May 24, 2025Updated last year
Downupanddownup / RefAudioSelectorV2-BaseOn-GptSoVits
View on GitHub
基于GptSoVits项目的参考音频筛选工具
☆26Aug 17, 2025Updated 11 months ago
k2-fsa / Flow2GAN
View on GitHub
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
☆144Mar 8, 2026Updated 4 months ago
yrom / finetune-index-tts
View on GitHub
IndexTTS Fine-tuning notebooks
☆138Jun 17, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
FireRedTeam / FireRedASR
View on GitHub
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…
☆1,937Feb 25, 2026Updated 4 months ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
FireRedTeam / FireRedTTS2
View on GitHub
Long-form streaming TTS system for multi-speaker dialogue generation
☆1,412Oct 26, 2025Updated 8 months ago
neosun100 / indextts2-docker
View on GitHub
Production-ready Docker images for IndexTTS2 - Zero-shot text-to-speech with emotion control
☆16Dec 7, 2025Updated 7 months ago
diudiu62 / CosyVoice-api
View on GitHub
☆33Feb 28, 2025Updated last year
IDEA-Emdoor-Lab / DistilCodec
View on GitHub
A Neural Audio Codec (NAC) for Universal Audio
☆46May 30, 2025Updated last year
GiantAILab / DiaMoE-TTS
View on GitHub
Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptat…
☆246Nov 28, 2025Updated 7 months ago
catcto / CosyVoiceDocker
View on GitHub
This repository provides a Docker image for CosyVoice
☆27Dec 22, 2024Updated last year
FunAudioLLM / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,902Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
adelacvg / ttts
View on GitHub
Train the next generation of TTS systems.
☆169Sep 13, 2024Updated last year
XiaomiMiMo / MiMo-Audio
View on GitHub
MiMo-Audio: Audio Language Models are Few-Shot Learners
☆1,064Jun 17, 2026Updated last month
Plachtaa / seed-vc
View on GitHub
zero-shot voice conversion & singing voice conversion, with real-time support
☆3,878Apr 20, 2025Updated last year
IDEA-Emdoor-Lab / UniTTS
View on GitHub
A TTS Trained on Universal Audio.
☆41Jun 6, 2025Updated last year
myshell-ai / MeloTTS
View on GitHub
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
☆7,544Dec 24, 2024Updated last year
DataoceanAI / Dolphin
View on GitHub
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
☆772Jun 11, 2026Updated last month
wenet-e2e / WeTextProcessing
View on GitHub
Text Normalization & Inverse Text Normalization
☆800Jun 26, 2026Updated 3 weeks ago