CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)
☆189Mar 13, 2025Updated 11 months ago
Alternatives and similar repositories for CosyVoice2-Ex
Users that are interested in CosyVoice2-Ex are comparing it to the libraries listed below
Sorting:
- 一个用于CosyVoice的api接口项目☆336Aug 31, 2025Updated 6 months ago
- 使用vllm加速cosyvoice2的推理☆486Apr 26, 2025Updated 10 months ago
- 内容审核及速率限制服务☆26May 18, 2025Updated 9 months ago
- A RESTful API extension for CosyVoice text-to-speech, with a web-based testing interface.☆17Mar 14, 2025Updated 11 months ago
- A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.☆39Aug 13, 2024Updated last year
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆19,913Feb 11, 2026Updated 3 weeks ago
- (MacOS Support) OpenAI compatible http server for Spark-TTS☆15May 1, 2025Updated 10 months ago
- API server for VibeVoice☆27Sep 28, 2025Updated 5 months ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- Enhanced CosyVoice with one-click Windows installer, voice management WebUI, and a vLLM-accelerated OpenAI TTS API.☆23Aug 3, 2025Updated 7 months ago
- ☆33Feb 28, 2025Updated last year
- 基于FastAPI的语音服务系统,集成语音合成(TTS)和语音识别(STT)功能。使用CosyVoice2作为TTS引擎,FunASR作为STT引擎,支持零样本语音克隆、流式输出、多种语言识别等高级功能。☆20Apr 1, 2025Updated 11 months ago
- 使用 Python 制作简单视频 🎬☆17May 3, 2022Updated 3 years ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆44Nov 21, 2024Updated last year
- ☆20May 30, 2024Updated last year
- FastAPI Server Implementation for Bilibili Index TTS☆25Apr 13, 2025Updated 10 months ago
- [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching☆1,253Mar 2, 2026Updated last week
- 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.☆1,386Feb 3, 2026Updated last month
- Bert-VITS2_V202本地一键推理☆20Nov 23, 2023Updated 2 years ago
- AIGCPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。☆4,646Feb 7, 2026Updated last month
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆23Feb 4, 2025Updated last year
- one component for text to sonic video in ComfyUI☆25Apr 16, 2025Updated 10 months ago
- ☆146Jun 21, 2024Updated last year
- CosyVoice在Windows环境下使用的版本☆755Nov 19, 2024Updated last year
- 事件驱动的Unity行为树框架,附带基于GraphView的可视化编辑器与调试器☆34Jan 14, 2024Updated 2 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆129Apr 26, 2023Updated 2 years ago
- Explore how to get a VQ-VAE models efficiently!☆68Jul 24, 2025Updated 7 months ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated 2 years ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆697Nov 27, 2025Updated 3 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆872Dec 2, 2025Updated 3 months ago
- A FastAPI service for text-to-speech synthesis using the F5-TTS model. Includes authentication token☆36Apr 25, 2025Updated 10 months ago
- PyQt6 1st try☆294Jan 5, 2025Updated last year
- Added vLLM support to IndexTTS for faster inference.☆1,075Updated this week
- Pseudo Streaming SenseVoice with Hotwords☆434Mar 13, 2025Updated 11 months ago
- ☆73Apr 17, 2025Updated 10 months ago
- A comprehensive WebUI Toolkit for Resemble-AI's Chatterbox☆23Jun 7, 2025Updated 9 months ago
- Common scripts, mainly for text processing and experimental control☆20Aug 24, 2012Updated 13 years ago
- A toolkit for speaker diarization.☆410Updated this week