Brakanier/FastCosyVoice

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Brakanier/FastCosyVoice)

Brakanier / FastCosyVoice

Fast CosyVoice3 inference with tensorRT and tensorRT-LLM

☆77

Alternatives and similar repositories for FastCosyVoice

Users that are interested in FastCosyVoice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mkgs210 / batch_fish_speech
View on GitHub
Boost your efficiency with Fish Speech Batch Inference. Easily process multiple texts and achieve consistently great results. 🗨️🐟
☆27Aug 4, 2025Updated 11 months ago
jingzhunxue / FlowMirror_HydraVox
View on GitHub
FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…
☆49Feb 17, 2026Updated 5 months ago
k2-fsa / Flow2GAN
View on GitHub
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
☆145Mar 8, 2026Updated 4 months ago
lab260ru / balalaika
View on GitHub
[INTERSPEECH 2026] Official code for "Balalaika: Data-Centric, Prosody-Aware Annotation Pipeline for Russian Speech"
☆21Updated this week
ScottishFold007 / Cosyvoice_DPO_NOTES
View on GitHub
CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO Fine-Tuning!
☆126Aug 8, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
channel-io / ch-tts-llasa-rl-grpo
View on GitHub
☆51Apr 20, 2026Updated 3 months ago
Sharl210 / ultimate-rvc-mobile
View on GitHub
面向 Android 的本地 RVC 移动端项目(仅支持v2版本模型)，支持音频推理、悬浮窗变声器、实时推理（移动端性能受限，权限受限故仅供演示用途）；新增独有特性：降噪优化，音域过滤，噪声过滤
☆14May 15, 2026Updated 2 months ago
xingchensong / FlashCosyVoice
View on GitHub
FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.
☆250Feb 25, 2026Updated 5 months ago
ModelTC / LightTTS
View on GitHub
LightTTS is a lightweight TTS inference framework optimized for CosyVoice2 and CosyVoice3, enabling fast and scalable speech synthesis in…
☆47Apr 14, 2026Updated 3 months ago
manyeyes / KaldiNativeFbankSharp
View on GitHub
c# wrapper for kaldi-native-fbank，used to extract audio features in speech recognition (ASR) task
☆10Jul 26, 2025Updated last year
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
hi-paris / CosyVoice2-EU
View on GitHub
Europeanized CosyVoice2 for French & German
☆17Mar 30, 2026Updated 3 months ago
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 10 months ago
ajd12342 / paraspeechclap
View on GitHub
Codebase for 'ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining'
☆23Jun 20, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pengzhendong / streaming-ChatTTS
View on GitHub
☆23Oct 30, 2024Updated last year
the-bird-F / Expressive-Vectors
View on GitHub
[ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
☆40Dec 24, 2025Updated 7 months ago
ryuclc / CosyVoice2-GRPO
View on GitHub
A simple implementation for improving CosyVoice2 by GRPO method
☆39May 5, 2026Updated 2 months ago
fengin / Fun-CosyVoice3-0.5B-2512-Deploy
View on GitHub
Fun-CosyVoice3-0.5B-2512 语音合成服务的简化部署方案，以及快速测试和部署提供应用调用
☆100Dec 24, 2025Updated 7 months ago
inclusionAI / MingTok-Audio
View on GitHub
☆88Feb 24, 2026Updated 5 months ago
alobashev / mkl-vc
View on GitHub
[Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"
☆45Sep 24, 2025Updated 10 months ago
pengzhendong / torchfa
View on GitHub
Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.
☆61Sep 5, 2025Updated 10 months ago
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
JudeJiwoo / nmt
View on GitHub
☆15Apr 13, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
kandinskylab / kvae-audio
View on GitHub
KVAE-Audio: a continuous full-band audio waveform autoencoder
☆101Updated this week
Berkeley-Speech-Group / StyleStream
View on GitHub
☆60Jun 11, 2026Updated last month
andimarafioti / nano-parakeet
View on GitHub
Pure-PyTorch Parakeet TDT inference
☆51Mar 10, 2026Updated 4 months ago
liuhuang31 / HiFTNet-sr
View on GitHub
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Jan 2, 2024Updated 2 years ago
yu-haoyuan / fd-badcat
View on GitHub
fd-sds
☆20Apr 8, 2026Updated 3 months ago
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated 2 months ago
yynil / RWKVTTS
View on GitHub
This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).
☆101Oct 8, 2025Updated 9 months ago
ydqmkkx / ShallowFlowMatching-TTS
View on GitHub
Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis
☆55Sep 20, 2025Updated 10 months ago
AmphionTeam / SpeechJudge
View on GitHub
SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)
☆78Dec 23, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
IS2AI / KazEmoTTS
View on GitHub
An open-source Kazakh Emotional Text-to-Speech Dataset
☆36Aug 1, 2025Updated 11 months ago
wenet-e2e / west
View on GitHub
We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction
☆206Jul 17, 2026Updated last week
zaigie / FunSpeech
View on GitHub
开箱即用的本地私有化部署语音服务，快速搭建Qwen3ASR/FunASR与Qwen3TTS/CosyVoice后端
☆153Jul 6, 2026Updated 2 weeks ago
ryota-komatsu / speech_resynth
View on GitHub
Speech Resynthesis and Language Modeling
☆27Jun 11, 2025Updated last year
Harry-Yu-Shuhang / Step-Audio-tts
View on GitHub
☆11Feb 20, 2025Updated last year
ASLP-lab / Hum-Dial
View on GitHub
ICASSP2026 HumDial Challenge
☆50May 28, 2026Updated last month
ASLP-lab / VoiceSculptor
View on GitHub
An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.
☆250Feb 26, 2026Updated 4 months ago