ProjectTUHs / SambertTTS-WebUILinks
WebUI build on SambertHifigan-TTS
☆11Updated 2 years ago
Alternatives and similar repositories for SambertTTS-WebUI
Users that are interested in SambertTTS-WebUI are comparing it to the libraries listed below
Sorting:
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆172Updated 11 months ago
- 通过此代码可以免训练模型并通过轻量级服务器定制数字人形象☆106Updated last year
- ☆204Updated last year
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆105Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆83Updated 8 months ago
- ☆379Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 3 months ago
- Just a suturing monster project.☆42Updated 2 years ago
- ☆70Updated 2 years ago
- 一个用于CosyVoice的api接口项目☆334Updated 5 months ago
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆311Updated last month
- 文本语料转训练集工具,txt转dataset☆93Updated last year
- ChatTTS HTTP API☆54Updated last year
- 优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs☆81Updated last year
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆48Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆421Updated 10 months ago
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆524Updated 2 years ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Updated last year
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆94Updated last week
- 使用vllm加速cosyvoice2的推理☆478Updated 9 months ago
- Documentation for Bert-VITS2☆22Updated 2 years ago
- 语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译☆81Updated last year
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆47Updated 9 months ago
- 将Wav2Lip和GFPGAN进行结合实现高清数字人说话视频☆37Updated 8 months ago
- Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would t…☆63Updated 2 years ago
- ☆35Updated 2 years ago
- ☆42Updated 2 years ago
- 主要写er-nerf从零到一所有部署过程☆43Updated last year
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Updated last year
- Utilizes ONNX Runtime to transcribe audio into text.☆78Updated this week