jaywalnut310 / vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
☆7,279Updated last year
Alternatives and similar repositories for vits:
Users that are interested in vits are comparing it to the libraries listed below
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,864Updated 2 months ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆4,431Updated last week
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,749Updated 11 months ago
- Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)☆2,085Updated last month
- 无需情感标注的情感可控语音合成模型,基于VITS☆1,372Updated 2 years ago
- An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singi…☆2,820Updated this week
- Singing Voice Conversion via diffusion model☆2,668Updated last year
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆923Updated last year
- vits2 backbone with multilingual-bert☆8,345Updated last week
- Executable file for VITS inference☆2,376Updated last year
- 多个SVC/TTS的C++推理库☆1,061Updated last month
- Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!☆1,180Updated last year
- SoftVC VITS Singing Voice Conversion☆26,807Updated last year
- A simple GUI application that slices audio with silence detection☆1,324Updated 8 months ago
- 基于vits与softvc的歌声音色转换模型☆3,679Updated 5 months ago
- Bark Voice Cloning and Voice Cloning for Chinese Speech☆2,881Updated this week
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html☆2,108Updated last year
- so-vits-svc fork with realtime support, improved interface and more features.☆8,946Updated 2 weeks ago
- Python script that slices audio with silence detection☆808Updated 9 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,532Updated 9 months ago
- An easy to understand TTS / SVS / SVC framework☆691Updated 3 weeks ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,082Updated 8 months ago
- Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch☆1,316Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆43,162Updated this week
- An unofficial PyTorch implementation of the audio LM VALL-E☆2,991Updated last year
- Fine-Tuning your VITS model using a pre-trained model☆554Updated last year
- Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc☆985Updated 2 years ago
- Easily train a good VC model with voice data <= 10 mins!☆28,227Updated 4 months ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆1,984Updated last year
- So-VITS-SVC 本地部署使用帮助文档,提供Colab笔记本 So-VITS-SVC Local Deployment Document and provide Colab notebook☆697Updated 2 weeks ago