sekift / so-vits-modelsView external linksLinks
收集有关so-vits-svc、TTS、SD、LLMs的各种模型、应用以及文字、声音、图片、视频有关的model。
☆205Jun 7, 2025Updated 8 months ago
Alternatives and similar repositories for so-vits-models
Users that are interested in so-vits-models are comparing it to the libraries listed below
Sorting:
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- So-VITS-SVC 本地部署使用帮助文档,提供Colab笔记本 So-VITS-SVC Local Deployment Document and provide Colab notebook☆747Mar 31, 2025Updated 10 months ago
- 能让b站任何能搜到素材的角色唱任何能在b站搜到的歌☆20Jun 16, 2024Updated last year
- 基于PyQt5写的一个音频响度匹配小工具,目前支持4种匹配方式☆10Aug 14, 2025Updated 5 months ago
- 用DiffSinger让bot唱歌☆15Apr 5, 2024Updated last year
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- ☆12Oct 20, 2023Updated 2 years ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- AIStarter是一款免费的AI项目管理平台,旨在让用户能够在Windows、Mac或Linux上快速轻松地下载、安装和分享各类热门AI开源项目。☆20Oct 17, 2025Updated 3 months ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- framepack,但是webui插件(加了点功能)☆21May 10, 2025Updated 9 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,847Apr 23, 2024Updated last year
- Text to Speech Synthesis based on controllable latent representation☆14Aug 30, 2019Updated 6 years ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Jul 10, 2024Updated last year
- ☆41Feb 28, 2024Updated last year
- Daily tracking of awesome aigc papers, including video generation, video editing, animation.☆24Aug 20, 2025Updated 5 months ago
- a samplers scheduler for stable diffusion webui version 1.6.x☆23Dec 23, 2023Updated 2 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆54Oct 31, 2023Updated 2 years ago
- ☆298May 22, 2024Updated last year
- ☆23Nov 9, 2024Updated last year
- ☆27Jun 28, 2024Updated last year
- ☆25Mar 6, 2024Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- This repository is a collection of ComfyUI nodes and workflows that can facilitate the creation of animations and video compilations. It …☆33Oct 21, 2025Updated 3 months ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- Superprompt a 77M Parameter T5 custom trained checkpoint to make dull prompts detailed.☆69May 23, 2024Updated last year
- Custom nodes for ComfyUI to run AuraSR models.☆28Sep 2, 2025Updated 5 months ago
- Lets make video diffusion practical! Adding Start and end frame control to Framepack☆32Apr 20, 2025Updated 9 months ago
- Markdown to Telegram MarkdownV2 Converter☆13Jul 15, 2024Updated last year
- ☆33Nov 28, 2023Updated 2 years ago
- [二测]星穹铁道语音☆29Aug 21, 2022Updated 3 years ago
- ☆41May 15, 2023Updated 2 years ago
- ComfyUI node pack by cerspense☆36Dec 29, 2025Updated last month
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆129Jul 12, 2024Updated last year
- Bilingual-TTS (Japanese and Korean)☆32Jul 1, 2023Updated 2 years ago
- Easily train a good VC model with voice data <= 10 mins!☆34,414Nov 24, 2024Updated last year
- 绝区零 一条龙 | 全自动 | 自动闪避 | 自动每日 | 自动空洞 | 支持手柄(1.4游戏更新请耐心等待适配)☆15Updated this week
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago