Inference Specialization
☆510Jun 25, 2024Updated last year
Alternatives and similar repositories for GPT-SoVITS-Inference
Users that are interested in GPT-SoVITS-Inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目意图在于让使用各类语音合成引擎的方式变得统一,支持多种语音合成引擎适配器,允许直接作为模组使用或启动后端服务☆773Apr 15, 2024Updated last year
- 【脱离复杂的环境配置和整合包,极简配置推理服务】从GPT-SoVITS项目里面提取出来的,纯粹的推理服务方案。☆321Apr 11, 2024Updated last year
- 这是一个批量推理工具,对同一段文字进行多次推理,并且支持随机参数,直到筛选出最满意的结果。☆11Aug 19, 2024Updated last year
- A cli tool for split vocal timbre.☆281Jan 17, 2026Updated 2 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆56,367Feb 9, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆107Mar 8, 2024Updated 2 years ago
- A simple VITS HTTP API, developed by extending Moegoe with additional features.☆1,047Oct 5, 2025Updated 6 months ago
- 适用于 GPT-SoVITS 的api调用接口☆334Mar 7, 2024Updated 2 years ago
- 一种基于Emotion2Vec的批量音频情感自动标注脚本☆523Mar 7, 2025Updated last year
- GPT-SoVITS 参考音频推理效果批量试 听☆53Mar 8, 2024Updated 2 years ago
- Make audio books in one click! Let Genshin characters read novels for you!☆30Aug 2, 2024Updated last year
- 主要写er-nerf从零到一所有部署过程☆44Aug 28, 2024Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20,400Mar 16, 2026Updated 3 weeks ago
- vits2 backbone with multilingual-bert☆8,720Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- GPT-SoVITS ONNX Inference Engine & Model Converter☆1,474Apr 1, 2026Updated last week
- Vocal Remover using Deep Neural Networks☆19Dec 31, 2024Updated last year
- This project provides a production-ready, real-time inference server for LatentSync, enabling high-quality, low-latency 2D digital human …☆23Aug 16, 2025Updated 7 months ago
- Bert-VITS2 onnx推理版本☆44Apr 24, 2024Updated last year
- ☆26Mar 13, 2024Updated 2 years ago
- GPT-SoVITS-V2模型,合并了官方的一些PR,包含但不限于:参考音频自动填充,字幕同步,SillyTavern酒馆接入等功能☆197Jan 15, 2025Updated last year
- 对接本地部署的GPT_SoVITS,为astrbot提供文本转语音(TTS)服务☆77Mar 15, 2026Updated 3 weeks ago
- GPT-SoVITS2☆229Feb 9, 2026Updated 2 months ago
- ☆67Jul 26, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Easily train a good VC model with voice data <= 10 mins!☆35,140Nov 24, 2024Updated last year
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆19Nov 23, 2024Updated last year
- GAG is a GUI for GPT-SoVITS inference. Just add it to the official integration package and run for a smoother experience.☆233Jun 24, 2025Updated 9 months ago
- SOTA Open Source TTS☆29,048Mar 30, 2026Updated last week
- A generative speech model for daily dialogue.☆39,019Jan 18, 2026Updated 2 months ago
- 通过AI实现对话者的识别并进行文段分割,再接入语音合成,自动生成有声小说☆28Apr 3, 2025Updated last year
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆15,544Mar 17, 2026Updated 3 weeks ago
- VC Without Retrain!☆129Apr 27, 2024Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multilingual Voice Understanding Model☆7,918Dec 30, 2025Updated 3 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,835Jun 28, 2024Updated last year
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆5,533Sep 26, 2025Updated 6 months ago
- ☆21Jun 7, 2024Updated last year
- AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/…☆4,329Jul 29, 2025Updated 8 months ago
- GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code☆1,810Oct 18, 2024Updated last year
- 低成本的简单基于live2d TTS文字转语音和大模型聊天的直播解决方案☆276Jul 4, 2024Updated last year