Inference Specialization
☆513Jun 25, 2024Updated 2 years ago
Alternatives and similar repositories for GPT-SoVITS-Inference
Users that are interested in GPT-SoVITS-Inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目意图在于让使用各类语音合成引擎的方式变得统一,支持多种语音合成引擎适配器,允许直接作为模组使用或启动后端服务☆770Apr 15, 2024Updated 2 years ago
- 【脱离复杂的环境配置和整合包,极简配置推理服务】从GPT-SoVITS项目里面提取出来的,纯粹的推理服务方案。☆325Apr 11, 2024Updated 2 years ago
- 这是一个批量推理工具,对同一段文字进行多次推理,并且支持随机参数,直到筛选出最满意的结果。☆11Aug 19, 2024Updated last year
- A cli tool for split vocal timbre.☆292Jan 17, 2026Updated 5 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆59,114Jun 20, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆107Mar 8, 2024Updated 2 years ago
- A simple VITS HTTP API, developed by extending Moegoe with additional features.☆1,047May 18, 2026Updated last month
- 适用于 GPT-SoVITS 的api调用接口☆341Mar 7, 2024Updated 2 years ago
- 一种基于Emotion2Vec的批量音频情感自动标注脚本☆542Mar 7, 2025Updated last year
- GPT-SoVITS 参考音频推理效果批量试听☆52Mar 8, 2024Updated 2 years ago
- Make audio books in one click! Let Genshin characters read novels for you!☆29Aug 2, 2024Updated last year
- 主要写er-nerf从零到一所有部署过程☆44Aug 28, 2024Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆21,878May 25, 2026Updated last month
- vits2 backbone with multilingual-bert☆8,761Jun 22, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GPT-SoVITS ONNX Inference Engine & Model Converter☆1,612Apr 18, 2026Updated 2 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆16Sep 18, 2024Updated last year
- Vocal Remover using Deep Neural Networks☆21Dec 31, 2024Updated last year
- Bert-VITS2 onnx推理版本☆44Apr 24, 2024Updated 2 years ago
- ☆26Mar 13, 2024Updated 2 years ago
- Speaker embedding for anime speech domain based on ECAPA_TDNN☆20Jun 22, 2025Updated last year
- GPT-SoVITS-V2模型,合并了官方的一些PR,包含但不限于:参考音频自动填充,字幕同步,SillyTavern酒馆接入等功能☆207Jan 15, 2025Updated last year
- ☆66Jul 26, 2025Updated 11 months ago
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆20Nov 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Easily train a good VC model with voice data <= 10 mins!☆36,120Nov 24, 2024Updated last year
- This project provides a production-ready, real-time inference server for LatentSync, enabling high-quality, low-latency 2D digital human …☆26Aug 16, 2025Updated 10 months ago
- GAG is a GUI for GPT-SoVITS inference. Just add it to the official integration package and run for a smoother experience.☆238Jun 24, 2025Updated last year
- A generative speech model for daily dialogue.☆39,498Apr 10, 2026Updated 2 months ago
- SOTA Open Source TTS☆30,996Jun 9, 2026Updated 3 weeks ago
- 通过AI实现对话者的识别并进行文段分割,再接入语音合成,自动生成有声小说☆29Apr 3, 2025Updated last year
- Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-…☆18,699Updated this week
- VC Without Retrain!☆131Apr 27, 2024Updated 2 years ago
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoreg…☆8,713Updated this week
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆6,061Sep 26, 2025Updated 9 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,842Jun 28, 2024Updated 2 years ago
- Genshin Datasets For SVC/SVS/TTS☆729Jan 11, 2026Updated 5 months ago
- ☆20Jun 7, 2024Updated 2 years ago
- AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/…☆4,397Jul 29, 2025Updated 11 months ago
- GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code☆1,809Oct 18, 2024Updated last year