收集有关so-vits-svc、TTS、SD、LLMs的各种模型、应用以及文字、声音、图片、视频有关的model。
☆207Jun 7, 2025Updated 9 months ago
Alternatives and similar repositories for so-vits-models
Users that are interested in so-vits-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于PyQt5写的一个音频响度匹配小工具,目前支持4种匹配方式☆10Aug 14, 2025Updated 7 months ago
- So-VITS-SVC 本地部署使用帮助文档,提供Colab笔记本 So-VITS-SVC Local Deployment Document and provide Colab notebook☆747Mar 31, 2025Updated 11 months ago
- 用DiffSinger让bot唱歌☆15Apr 5, 2024Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- ☆12Oct 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- SoftVC VITS Singing Voice Conversion☆28,046Nov 11, 2023Updated 2 years ago
- 基于GptSoVits项目的参考音频筛选工具☆23Aug 17, 2025Updated 7 months ago
- AIStarter是一款免费的AI项目管理平台,旨在让用户能够在Windows、Mac或Linux上快速轻松地下载、安装和分享各类热门AI开源项目。☆21Oct 17, 2025Updated 5 months ago
- a custom node for separation vocals from music based on Music-Source-Separation-Training☆26Oct 24, 2024Updated last year
- [PyTorch] Minimal codebase for MusicGen models☆63Jan 7, 2025Updated last year
- Self-supervised key estimation model that matches performance with supervised state-of-the-art model.☆48Jun 9, 2025Updated 9 months ago
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Apr 19, 2025Updated 11 months ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆298May 22, 2024Updated last year
- Official Implementation of "GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution"☆20Apr 3, 2024Updated last year
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,852Apr 23, 2024Updated last year
- Pytorch project accompanying the paper "Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings", …☆13Aug 26, 2022Updated 3 years ago
- framepack,但是webui插件(加了点功能)☆22May 10, 2025Updated 10 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- a samplers scheduler for stable diffusion webui version 1.6.x☆23Dec 23, 2023Updated 2 years ago
- Chat with your RVC models. See website for demo:☆22Feb 15, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Sep 13, 2022Updated 3 years ago
- Colab Notebook for SeamlessM4T model by Meta☆10Aug 23, 2023Updated 2 years ago
- Easily train a good VC model with voice data <= 10 mins!☆34,976Nov 24, 2024Updated last year
- Playwright-based Pixiv OAuth code & token fetcher☆27Apr 23, 2025Updated 11 months ago
- ☆25Mar 6, 2024Updated 2 years ago
- Code for ChordSync, a conformer-based audio-to-chord synchroniser☆14Oct 17, 2025Updated 5 months ago
- ☆41Feb 28, 2024Updated 2 years ago
- This is a repo for CVPR 2022 Paper with Code☆10Apr 13, 2022Updated 3 years ago
- ttsmaker is a Text-to-Speech library implemented using the TTSMaker API.☆14Apr 25, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Text to Speech Synthesis based on controllable latent representation☆14Aug 30, 2019Updated 6 years ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- Knox is a vigilant supervisor and management tool that ensures LLM teams rigorously develop reliable AI Agent programming extensions for …☆37Updated this week
- ☆41May 15, 2023Updated 2 years ago
- vits2 backbone with multilingual-bert☆8,717Mar 23, 2026Updated last week
- ☆22Nov 9, 2024Updated last year