tencent-ailab / learning_singing_from_speechLinks
Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".
☆10Updated 5 years ago
Alternatives and similar repositories for learning_singing_from_speech
Users that are interested in learning_singing_from_speech are comparing it to the libraries listed below
Sorting:
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 4 years ago
- Voice Conversion using Tacotron.☆11Updated 3 years ago
- ☆30Updated 5 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆92Updated 3 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Updated 3 years ago
- Real-Time High-Fidelity Speech Synthesis without GPU☆73Updated last year
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 3 years ago
- Finally, some decent sample sentences☆23Updated 2 years ago
- ☆23Updated 2 years ago
- Demo for 2022 ICASSP☆64Updated 3 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- ☆10Updated last year
- Official PyTorch implementation of TTS Style Transfer☆25Updated 3 years ago
- Speech-conditioned face generation using Generative Adversarial Networks☆88Updated 3 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- Real-time melgan based on cpu !!!☆13Updated 6 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Updated 5 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 5 years ago
- A Pytorch Implementation of MelNet☆26Updated 5 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 4 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Updated 3 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Updated 4 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 7 years ago
- ☆13Updated 2 years ago
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆77Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated 2 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 4 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 3 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆58Updated 6 years ago