tencent-ailab / learning_singing_from_speechLinks
Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".
☆10Updated 4 years ago
Alternatives and similar repositories for learning_singing_from_speech
Users that are interested in learning_singing_from_speech are comparing it to the libraries listed below
Sorting:
- ☆30Updated 5 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 3 years ago
- Demo for 2022 ICASSP☆64Updated 3 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Updated 2 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 4 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 4 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆91Updated 3 years ago
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆122Updated 2 years ago
- ☆25Updated 6 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32Updated 6 years ago
- Finally, some decent sample sentences☆23Updated last year
- TensorFlow implementation of "GANSynth: Adversarial Neural Audio Synthesis"☆67Updated 6 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆82Updated 2 years ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 3 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- The project page repo for Neural Dubber.☆30Updated last year
- Speech-conditioned face generation using Generative Adversarial Networks☆88Updated 2 years ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆34Updated 4 years ago
- Real-Time High-Fidelity Speech Synthesis without GPU☆73Updated last year
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆77Updated 3 years ago
- ☆130Updated 2 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Updated 2 years ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆21Updated 4 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Updated 4 years ago
- Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)☆117Updated 3 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109Updated 3 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago