CMsmartvoice / One-Shot-Voice-Cloning
One Shot Voice Cloning base on Unet-TTS
☆240Updated 2 years ago
Alternatives and similar repositories for One-Shot-Voice-Cloning:
Users that are interested in One-Shot-Voice-Cloning are comparing it to the libraries listed below
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆190Updated 2 years ago
- Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher☆178Updated last year
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆234Updated 11 months ago
- StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion☆494Updated last month
- PPG-Based Voice Conversion☆332Updated 2 years ago
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆236Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆359Updated last year
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆344Updated 2 years ago
- Singing Voice Synthesis based on VITS, different from VISinger☆188Updated last year
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆295Updated 3 years ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆156Updated 3 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆199Updated 2 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆242Updated 3 years ago
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion☆649Updated last month
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆141Updated last year
- An 16kHz implementation of HiFi-GAN for soft-vc.☆96Updated last year
- HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆344Updated 4 months ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆140Updated last year
- Official Implementation of StyleTTS☆418Updated last month
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆324Updated 2 years ago
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆276Updated last year
- Voice Conversion by CycleGAN (语音克隆/语音转换):CycleGAN-VC3☆141Updated 2 years ago
- VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer☆333Updated 3 months ago
- Unoffical implementation of Megatts2☆276Updated 10 months ago
- text to speech using autoregressive transformer and VITS☆234Updated 10 months ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆201Updated 4 years ago
- PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)☆235Updated 3 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆159Updated 3 years ago
- The reproduced code for Google's SoundStorm☆264Updated last year
- This is the GitHub page for publicly available emotional speech data.☆336Updated 3 years ago