RVC-Boss / GPT-SoVITSLinks
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆47,858Updated this week
Alternatives and similar repositories for GPT-SoVITS
Users that are interested in GPT-SoVITS are comparing it to the libraries listed below
Sorting:
- vits2 backbone with multilingual-bert☆8,468Updated last week
- A generative speech model for daily dialogue.☆36,799Updated 3 weeks ago
- SOTA Open Source TTS☆21,914Updated last week
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆13,233Updated last month
- Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。☆13,080Updated this week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆14,672Updated last week
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆8,596Updated 6 months ago
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆7,134Updated last month
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,930Updated 5 months ago
- LLM Frontend for Power Users.☆15,287Updated this week
- 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.☆36,821Updated last week
- Easily train a good VC model with voice data <= 10 mins!☆30,118Updated 6 months ago
- SoftVC VITS Singing Voice Conversion☆27,237Updated last year
- 🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time☆36,329Updated 7 months ago
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆8,464Updated last month
- Bark Voice Cloning and Voice Cloning for Chinese Speech☆2,938Updated 2 months ago
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆35,368Updated this week
- Multilingual Voice Understanding Model☆5,951Updated 2 months ago
- LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key …☆25,679Updated 4 months ago
- 🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。☆6,004Updated 2 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆11,011Updated 3 weeks ago
- OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。☆34,893Updated 3 weeks ago
- 🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.☆28,436Updated this week
- ☆39,679Updated last month
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆79,795Updated this week
- 小红书(XiaoHongShu、RedNote)链接提取/作品采集工具:提取账号发布、收藏、点赞、专辑作品链接;提取搜索结果作品、用户链接;采集小红书作品信息;提取小红书作品下载地址;下载小红书无水印作品文件☆7,848Updated this week
- Stable Diffusion web UI☆153,685Updated last month
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,174Updated 3 weeks ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆12,322Updated last week
- ✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows☆83,788Updated this week