shenduldh/CosyVoice-Lightning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shenduldh/CosyVoice-Lightning)

shenduldh / CosyVoice-Lightning

Lightning-responsive CosyVoice streaming API based on FastAPI.

☆28

Alternatives and similar repositories for CosyVoice-Lightning

Users that are interested in CosyVoice-Lightning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qi-hua / async_cosyvoice
View on GitHub
使用vllm加速cosyvoice2的推理
☆498Apr 26, 2025Updated last year
kodeleung / CosyVoice2
View on GitHub
基于官方提供的CosyVoice改造，整体交互适配CosyVoice2模型，开箱即用
☆23Jun 15, 2025Updated last year
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
zhu-han / SpeechLLM
View on GitHub
LLM-based ASR recipe with Zipformer encoder and Qwen LLM
☆34Sep 25, 2025Updated 9 months ago
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
dmisol / flexatar-virtual-webcam
View on GitHub
Personalized Virtual Webcam for WebRTC
☆19Apr 20, 2026Updated 3 months ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
dadDR / rkllm_talking
View on GitHub
rkllm_talking is a standalone compiled voice communication system based on a large model || rkllm_talking 是一个独立编译的基于大模…
☆15Oct 13, 2024Updated last year
xingchensong / FlashCosyVoice
View on GitHub
FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.
☆250Feb 25, 2026Updated 4 months ago
Mddct / usm-tokenizer
View on GitHub
semantic tokenizer for speech and music
☆20Jul 6, 2025Updated last year
ydqmkkx / ShallowFlowMatching-TTS
View on GitHub
Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis
☆55Sep 20, 2025Updated 10 months ago
colaudiolab / AudioSet-R
View on GitHub
Official implementation: "AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation"
☆19Oct 9, 2025Updated 9 months ago
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated 2 months ago
ZYFZYF / Face-beautification
View on GitHub
《数字媒体(2)：多媒体》课程中音频小课堂大作业-人脸美化任务
☆15Jul 14, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
inclusionAI / MingTok-Audio
View on GitHub
☆88Feb 24, 2026Updated 4 months ago
yukara-ikemiya / Open-Miipher-2
View on GitHub
PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind
☆70Sep 22, 2025Updated 9 months ago
adelacvg / DPTTS
View on GitHub
An AR+AR TTS attempt.
☆18Jan 13, 2025Updated last year
Vokturz / fast-embeddings-api
View on GitHub
fast-embeddings-api
☆16Nov 23, 2023Updated 2 years ago
pengzhendong / streaming-tts-webui
View on GitHub
Streaming Text to Speech Web UI
☆22May 6, 2024Updated 2 years ago
neosun100 / supertonic-tts-enhanced
View on GitHub
Enhanced Supertonic TTS with Docker, FastAPI, Web UI, and comprehensive API documentation
☆21Dec 7, 2025Updated 7 months ago
wangzhaode / mnn-tts
View on GitHub
mnn tts demo.
☆19May 7, 2025Updated last year
google-deepmind / librispeech-long
View on GitHub
LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …
☆98Dec 28, 2024Updated last year
ScottishFold007 / Cosyvoice_DPO_NOTES
View on GitHub
CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO Fine-Tuning!
☆126Aug 8, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
3loi / NaturalVoices
View on GitHub
☆61Oct 22, 2025Updated 8 months ago
wyysf-98 / shapenet_render
View on GitHub
blender scripts for shapenet
☆11Oct 12, 2020Updated 5 years ago
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
av1d / NPU-Chat
View on GitHub
Web chat front end for rk3588_npu_llm_server / RK3588 LLM chat interface
☆16Jul 16, 2024Updated 2 years ago
xphh / fireredasr-streaming
View on GitHub
low-latency realtime ASR based on FireRedASR
☆62Jul 8, 2025Updated last year
lonzi / mrflow_dpo
View on GitHub
☆22Jan 3, 2026Updated 6 months ago
AMAAI-Lab / DART
View on GitHub
Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
☆16Apr 22, 2026Updated 2 months ago
wutong8023 / SpeechRE
View on GitHub
☆11Nov 11, 2022Updated 3 years ago
UniqueMR / Signal-Generator-Nexy4
View on GitHub
Signal generator designed with Nexy4 FPGA
☆13May 14, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
yrom / finetune-index-tts
View on GitHub
IndexTTS Fine-tuning notebooks
☆138Jun 17, 2025Updated last year
Tencent / SongBench
View on GitHub
☆50Apr 30, 2026Updated 2 months ago
haoweilou / ParaStyleTTS
View on GitHub
This is the official code for ACM CIKM 2025 Paper: ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive …
☆59Dec 21, 2025Updated 7 months ago
SUHONGJIAN / Matlab-Visual-Processing-Face-Detection
View on GitHub
使用肤色颜色空间建模+连通域处理及分析和Harr-cascade 方法进行人脸检测。1建立多种肤色模型，结合数学形态学滤波，完成人脸检测； 2利用Matlab 自带的计算机视觉系统工具箱实现单人及多人的人脸检测。
☆13Nov 23, 2018Updated 7 years ago
lars76 / forced-alignment-chinese
View on GitHub
Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
☆19Aug 13, 2024Updated last year
hljodbokasafnid / Ascanius
View on GitHub
Automates the creation of full-text (sound and text) ebooks in epub/epub3/daisy format, the webserver/client creates smil files to sync a…
☆10Nov 12, 2021Updated 4 years ago
Annafavaro / PARKCELEB
View on GitHub
☆11Jun 13, 2026Updated last month