VC Without Retrain!
☆129Apr 27, 2024Updated last year
Alternatives and similar repositories for GPT-SoVITS-VC
Users that are interested in GPT-SoVITS-VC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GPT-SoVITS2☆229Feb 9, 2026Updated last month
- ☆26Mar 20, 2024Updated 2 years ago
- text to speech using autoregressive transformer and VITS☆248Apr 3, 2024Updated last year
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆237Feb 29, 2024Updated 2 years ago
- 音频响度统一,音量归一化处理☆13May 3, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- a lightweight voice conversion☆86Feb 25, 2026Updated last month
- ☆298May 22, 2024Updated last year
- Huawei Grad-TTS for Chinese☆51Sep 26, 2023Updated 2 years ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80May 29, 2023Updated 2 years ago
- ☆12Nov 7, 2024Updated last year
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆94Jan 31, 2026Updated last month
- ☆39Oct 1, 2023Updated 2 years ago
- Easy-to-Use Speech MOS predictors☆349Oct 24, 2023Updated 2 years ago
- An Open-Sourced LLM-empowered Foundation TTS System☆907Sep 28, 2025Updated 6 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆130Jul 30, 2024Updated last year
- ☆51May 1, 2024Updated last year
- ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis☆155Sep 20, 2024Updated last year
- Self-supervised Generative LM-based Voice Conversion☆55Apr 24, 2025Updated 11 months ago
- The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss☆14Sep 4, 2023Updated 2 years ago
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆259Jul 13, 2023Updated 2 years ago
- 通过此代码可以免训练模型并通过轻量级服务器定制数字人形象☆106Mar 27, 2024Updated 2 years ago
- Simple data labeling script with funasr inside. 使用阿里fanasr进行VITS训练数据标注☆80Oct 10, 2023Updated 2 years ago
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆58Nov 10, 2025Updated 4 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆59Jun 28, 2024Updated last year
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆435Sep 13, 2024Updated last year
- A simple python wrapper for gpupixel using SourceRawDataInput and TargetRawDataOutput.☆11Aug 14, 2024Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for V…☆243Jul 31, 2024Updated last year
- VoiceBox neural network implementation☆110Aug 2, 2024Updated last year
- ☆28Oct 1, 2023Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official Code for ParrotTTS☆58Oct 13, 2024Updated last year
- unofficial vits2-TTS implementation in pytorch☆548Mar 28, 2024Updated 2 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- ☆11Feb 20, 2025Updated last year
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago