adasegroup / OSM-one-shot-multispeaker
Framework for one-shot multispeaker system based on Deep Learning
☆19Updated 3 years ago
Alternatives and similar repositories for OSM-one-shot-multispeaker:
Users that are interested in OSM-one-shot-multispeaker are comparing it to the libraries listed below
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- ☆15Updated 3 years ago
- Training code and trained checkpoints for ASGAN.☆62Updated last year
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- ☆36Updated 3 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 2 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Updated 3 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- Demo audio of VARA-TTS model☆20Updated 3 years ago
- Temporary anonymous version☆22Updated 10 months ago
- ☆11Updated last year
- with alignment learning and continuous wavelet transform☆20Updated 2 years ago
- ☆26Updated 10 months ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 3 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated 10 months ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆53Updated 2 years ago
- ☆30Updated 2 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 3 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- A unified model for zero-shot singing voice conversion and synthesis☆21Updated 2 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Updated 9 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆76Updated last year
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆33Updated last month