Deepest-Project / MelNet
Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
☆211Updated 9 months ago
Alternatives and similar repositories for MelNet:
Users that are interested in MelNet are comparing it to the libraries listed below
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆239Updated 4 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆228Updated 3 years ago
- A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS☆231Updated 5 years ago
- Voice Conversion Challenge 2020 CycleVAE baseline system☆133Updated 4 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆335Updated last year
- Code to train and run Blow☆143Updated 5 years ago
- Authors' implementation of DeepSpeech Distances.☆129Updated 5 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆249Updated 2 years ago
- A pytorch implementation of StarGAN-VC2☆147Updated 4 years ago
- Multi-voice singing voice synthesis☆237Updated 2 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated 9 months ago
- A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio" (ICML 2020)☆123Updated 9 months ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated 9 months ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆644Updated 4 years ago
- A WaveRNN implementation☆199Updated 5 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆125Updated 6 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Updated 4 years ago
- ☆130Updated 2 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆402Updated 3 years ago
- Official code for Cotatron @ INTERSPEECH 2020☆213Updated 9 months ago
- WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆251Updated 5 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆167Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆51Updated 5 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆367Updated 6 years ago
- Official Code for Assem-VC @ICASSP2022☆266Updated 2 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 4 years ago
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆125Updated 4 years ago
- A Pytorch Implementation of MelGAN☆67Updated 5 years ago
- parallel wavenet based on nsynth☆107Updated 6 years ago