innnky / ar-vits
text to speech using autoregressive transformer and VITS
☆224Updated 5 months ago
Related projects: ⓘ
- Unoffical implementation of Megatts2☆255Updated 5 months ago
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆221Updated last year
- Preprocess Audio for training☆223Updated last month
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆225Updated 6 months ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆123Updated 10 months ago
- Singing Voice Synthesis based on VITS, different from VISinger☆182Updated 10 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆334Updated this week
- ☆98Updated 3 weeks ago
- ☆248Updated last year
- ☆181Updated last year
- Train the next generation of TTS systems.☆159Updated this week
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆192Updated 2 years ago
- The reproduced code for Google's SoundStorm☆241Updated 11 months ago
- ☆203Updated 7 months ago
- ☆92Updated last month
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆125Updated 5 months ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆131Updated 11 months ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆295Updated 2 weeks ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆274Updated last year
- GPT-SoVITS2☆167Updated last month
- unofficial vits2-TTS implementation in pytorch☆472Updated 5 months ago
- Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)☆211Updated this week
- Official Implementation of StyleTTS☆387Updated 9 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆107Updated 2 months ago
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆92Updated last week
- VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design☆461Updated last year
- VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer☆309Updated 2 months ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆122Updated last year
- ☆139Updated 8 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆66Updated 11 months ago