auspicious3000 / contentvec
speech self-supervised representations
☆477Updated last year
Alternatives and similar repositories for contentvec:
Users that are interested in contentvec are comparing it to the libraries listed below
- Soft speech units for voice conversion☆418Updated 10 months ago
- HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆343Updated 3 months ago
- VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer☆331Updated 2 months ago
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆235Updated last year
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆277Updated last year
- unofficial vits2-TTS implementation in pytorch☆505Updated 10 months ago
- Singing Voice Synthesis based on VITS, different from VISinger☆187Updated last year
- singing voice change based on whisper, and lora for singing voice clone☆630Updated last year
- Voice Conversion With Just Nearest Neighbors☆469Updated 10 months ago
- StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion☆491Updated 2 weeks ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆433Updated 2 years ago
- A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (…☆413Updated 2 years ago
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆234Updated 11 months ago
- How to use our public wav2vec2 dimensional emotion model☆476Updated last year
- PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)☆233Updated 2 years ago
- ☆253Updated last year
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆324Updated 2 years ago
- An opensource music processing toolkit☆311Updated last year
- Official Implementation of StyleTTS☆414Updated 2 weeks ago
- Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher☆178Updated last year
- ☆114Updated 3 months ago
- Easy-to-Use Speech MOS predictors☆256Updated last year
- Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code☆427Updated last year
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion☆619Updated last week
- text to speech using autoregressive transformer and VITS☆234Updated 9 months ago
- The Open Source Code of UniAudio☆540Updated 6 months ago
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆942Updated 4 months ago
- ☆431Updated 2 months ago
- Pytorch implementation of the CREPE pitch tracker☆423Updated 7 months ago
- PPG-Based Voice Conversion☆332Updated 2 years ago