twidddj / vqvaeLinks
Tensorflow implementation of VQVAE for voice conversion
☆12Updated 7 years ago
Alternatives and similar repositories for vqvae
Users that are interested in vqvae are comparing it to the libraries listed below
Sorting:
- ☆22Updated 4 years ago
- PyTorch implementation of: Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences☆11Updated 5 years ago
- Voice Alignment and Conversion with Neural Networks and the WORLD codec.☆20Updated 6 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Updated 6 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Updated 2 years ago
- Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)☆22Updated 7 years ago
- An evaluation toolkit for voice conversion models.☆42Updated 3 years ago
- ☆51Updated 6 years ago
- ☆34Updated 5 years ago
- using world vocoder to extract features and make data for training neural networks☆11Updated 7 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- ☆22Updated 6 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 4 years ago
- voice conversion system☆25Updated 4 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆11Updated 6 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 6 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated 2 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆62Updated 3 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Updated 5 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆36Updated 7 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- ☆15Updated 4 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 5 years ago
- ☆12Updated last year
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Updated 2 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago