Deepest-Project / MelNet
Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
☆209Updated 8 months ago
Alternatives and similar repositories for MelNet:
Users that are interested in MelNet are comparing it to the libraries listed below
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆229Updated 2 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆644Updated 4 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆334Updated last year
- A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS☆230Updated 5 years ago
- Multi-voice singing voice synthesis☆236Updated 2 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆239Updated 4 years ago
- WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆251Updated 5 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆249Updated 2 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated 8 months ago
- A pytorch implementation of StarGAN-VC2☆147Updated 4 years ago
- A WaveRNN implementation☆199Updated 5 years ago
- Voice Conversion Challenge 2020 CycleVAE baseline system☆133Updated 4 years ago
- Authors' implementation of DeepSpeech Distances.☆129Updated 4 years ago
- Code to train and run Blow☆143Updated 5 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated 8 months ago
- A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio" (ICML 2020)☆123Updated 8 months ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆402Updated 3 years ago
- Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source t…☆67Updated 3 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆368Updated 6 years ago
- ☆471Updated 4 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆201Updated 4 years ago
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆75Updated 3 years ago
- Official code for Cotatron @ INTERSPEECH 2020☆212Updated 8 months ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 2 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆167Updated last year
- Official Code for Assem-VC @ICASSP2022☆266Updated 2 years ago
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆366Updated 2 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆128Updated 4 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 3 years ago
- Audio style transfer with shallow random parameters CNN.☆404Updated last month