收集有关so-vits-svc、TTS、SD、LLMs的各种模型、应用以及文字、声音、图片、视频有关的model。
☆206Jun 7, 2025Updated 9 months ago
Alternatives and similar repositories for so-vits-models
Users that are interested in so-vits-models are comparing it to the libraries listed below
Sorting:
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- So-VITS-SVC 本地部署使用帮助文档,提供Colab笔记本 So-VITS-SVC Local Deployment Document and provide Colab notebook☆748Mar 31, 2025Updated 11 months ago
- 基于PyQt5写的一个音频响度匹配小工具,目前支持4种匹配方式☆10Aug 14, 2025Updated 6 months ago
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- ☆12Oct 20, 2023Updated 2 years ago
- Official Implementation of "GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution"☆20Apr 3, 2024Updated last year
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- a custom node for separation vocals from music based on Music-Source-Separation-Training☆25Oct 24, 2024Updated last year
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,848Apr 23, 2024Updated last year
- SoftVC VITS Singing Voice Conversion☆28,014Nov 11, 2023Updated 2 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Aug 30, 2019Updated 6 years ago
- 基于GptSoVits项目的参考音频筛选工具☆23Aug 17, 2025Updated 6 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Jul 10, 2024Updated last year
- Daily tracking of awesome aigc papers, including video generation, video editing, animation.☆24Aug 20, 2025Updated 6 months ago
- ☆41Feb 28, 2024Updated 2 years ago
- a samplers scheduler for stable diffusion webui version 1.6.x☆23Dec 23, 2023Updated 2 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆54Oct 31, 2023Updated 2 years ago
- ☆298May 22, 2024Updated last year
- ☆22Nov 9, 2024Updated last year
- [PyTorch] Minimal codebase for MusicGen models☆63Jan 7, 2025Updated last year
- ☆26Jun 28, 2024Updated last year
- Chat with your RVC models. See website for demo:☆22Feb 15, 2024Updated 2 years ago
- ☆25Mar 6, 2024Updated 2 years ago
- A set of custom nodes for ComfyUI to set and combine scripts for text-2-video production.☆64Mar 1, 2025Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- This repository is a collection of ComfyUI nodes and workflows that can facilitate the creation of animations and video compilations. It …☆33Oct 21, 2025Updated 4 months ago
- Superprompt a 77M Parameter T5 custom trained checkpoint to make dull prompts detailed.☆69May 23, 2024Updated last year
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- Custom nodes for ComfyUI to run AuraSR models.☆28Sep 2, 2025Updated 6 months ago
- Lets make video diffusion practical! Adding Start and end frame control to Framepack☆32Apr 20, 2025Updated 10 months ago
- Markdown to Telegram MarkdownV2 Converter☆12Jul 15, 2024Updated last year
- ☆33Nov 28, 2023Updated 2 years ago
- [二测]星穹铁道语音☆29Aug 21, 2022Updated 3 years ago
- ComfyUI node pack by cerspense☆36Dec 29, 2025Updated 2 months ago
- ☆41May 15, 2023Updated 2 years ago
- Bilingual-TTS (Japanese and Korean)☆32Jul 1, 2023Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆39Jun 19, 2024Updated last year