Inference Specialization
☆512Jun 25, 2024Updated last year
Alternatives and similar repositories for GPT-SoVITS-Inference
Users that are interested in GPT-SoVITS-Inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目意图在于让使用各类语音合成引擎的方式变得统一,支持多种语音合成引擎适配器,允许直接作为模组使用或启动后端服务☆771Apr 15, 2024Updated 2 years ago
- 【脱离复杂的环境配置和整合包,极简配置推理服务】从GPT-SoVITS项目里面提取出来的,纯粹的推理服务方案。☆326Apr 11, 2024Updated 2 years ago
- 这是一个批量推理工具,对同一段文字进行多次推理,并且支持随机参数,直到筛选出最满意的结果。☆11Aug 19, 2024Updated last year
- A cli tool for split vocal timbre.☆288Jan 17, 2026Updated 4 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆57,541Apr 30, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆107Mar 8, 2024Updated 2 years ago
- A simple VITS HTTP API, developed by extending Moegoe with additional features.☆1,048Oct 5, 2025Updated 7 months ago
- 适用于 GPT-SoVITS 的api调用接口☆340Mar 7, 2024Updated 2 years ago
- 一种基于Emotion2Vec的批量音频情感自动标注脚本☆531Mar 7, 2025Updated last year
- GPT-SoVITS 参考音频推理效果批量试听☆52Mar 8, 2024Updated 2 years ago
- Make audio books in one click! Let Genshin characters read novels for you!☆29Aug 2, 2024Updated last year
- 主要写er-nerf从零到一所有部署过程☆44Aug 28, 2024Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆21,070May 3, 2026Updated 2 weeks ago
- GPT-SoVITS ONNX Inference Engine & Model Converter☆1,543Apr 18, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- vits2 backbone with multilingual-bert☆8,743Apr 27, 2026Updated 3 weeks ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆16Sep 18, 2024Updated last year
- Vocal Remover using Deep Neural Networks☆19Dec 31, 2024Updated last year
- Bert-VITS2 onnx推理版本☆44Apr 24, 2024Updated 2 years ago
- ☆26Mar 13, 2024Updated 2 years ago
- Speaker embedding for anime speech domain based on ECAPA_TDNN☆19Jun 22, 2025Updated 10 months ago
- GPT-SoVITS-V2模型,合并了官方的一些PR,包含但不限于:参考音频自动填充,字幕同步,SillyTavern酒馆接入等功能☆199Jan 15, 2025Updated last year
- ☆66Jul 26, 2025Updated 9 months ago
- 对接本地部署的GPT_SoVITS,为astrbot提供文本转语音(TTS)服务☆85Mar 15, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Easily train a good VC model with voice data <= 10 mins!☆35,638Nov 24, 2024Updated last year
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆20Nov 23, 2024Updated last year
- This project provides a production-ready, real-time inference server for LatentSync, enabling high-quality, low-latency 2D digital human …☆24Aug 16, 2025Updated 9 months ago
- GAG is a GUI for GPT-SoVITS inference. Just add it to the official integration package and run for a smoother experience.☆236Jun 24, 2025Updated 10 months ago
- A generative speech model for daily dialogue.☆39,277Apr 10, 2026Updated last month
- SOTA Open Source TTS☆30,356May 12, 2026Updated last week
- 通过AI实现对话者的识别并进行文段分割,再接入语音合成,自动生成有声小说☆29Apr 3, 2025Updated last year
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆16,101Mar 17, 2026Updated 2 months ago
- VC Without Retrain!☆130Apr 27, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- Multilingual Voice Understanding Model☆8,161Dec 30, 2025Updated 4 months ago
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆5,765Sep 26, 2025Updated 7 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,841Jun 28, 2024Updated last year
- Genshin Datasets For SVC/SVS/TTS☆721Jan 11, 2026Updated 4 months ago
- ☆20Jun 7, 2024Updated last year
- AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/…☆4,367Jul 29, 2025Updated 9 months ago