resemble-ai / MelNet
WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
☆249Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for MelNet
- Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆208Updated 3 months ago
- Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu☆253Updated 3 years ago
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆428Updated 3 years ago
- A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS☆229Updated 4 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆368Updated 5 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆120Updated 5 years ago
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)☆515Updated 4 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- Tensorflow Implementation of Expressive Tacotron☆197Updated 6 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆637Updated 4 years ago
- Audio style transfer with shallow random parameters CNN.☆404Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆464Updated 4 years ago
- End-2-end speech synthesis with recurrent neural networks☆225Updated 8 months ago
- Audio Denoising with Deep Network Priors☆163Updated 4 years ago
- Text to Speech with PyTorch (English and Mongolian)☆184Updated last month
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆237Updated 3 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆227Updated 2 years ago
- A WaveNet-based vocoder for fast inference☆161Updated 6 years ago
- Voice Converter Using CycleGAN and Non-Parallel Data☆526Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆358Updated last year
- PyTorch implementation of Tacotron speech synthesis model.☆308Updated 5 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated 3 months ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆854Updated last year
- ⏩ Generating speech in a single forward pass without any attention!☆579Updated 3 months ago
- Multi-voice singing voice synthesis☆235Updated last year
- A Pytorch Implementation of ClariNet☆289Updated 5 years ago
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆362Updated last year
- full tensorflow implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial netw…☆272Updated 7 months ago
- A WaveRNN implementation☆198Updated 5 years ago