r9y9 / nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
☆393Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for nnmnkwii
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆441Updated 4 months ago
- A Python wrapper for the high-quality vocoder "World"☆725Updated last year
- WaveNet-Vocoder implementation with pytorch.☆297Updated 4 years ago
- ☆149Updated 11 months ago
- PyTorch implementation of Tacotron speech synthesis model.☆309Updated 5 years ago
- End-2-end speech synthesis with recurrent neural networks☆225Updated 8 months ago
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)☆515Updated 4 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆368Updated 5 years ago
- ESPnet Model Zoo☆245Updated last year
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆237Updated 4 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆465Updated 4 years ago
- A WaveRNN implementation☆198Updated 5 years ago
- Voice Conversion Tool Kit☆598Updated last year
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆362Updated last year
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆329Updated last year
- A vocoder framework which had been widely used in research community since 1999.☆176Updated 5 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆250Updated last year
- A suite of speech signal processing tools☆226Updated this week
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated 3 months ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆855Updated last year
- INTERSPEECH 2019 Tutorial Materials☆193Updated 3 years ago
- Mel cepstral distortion (MCD) computations in python.☆213Updated 7 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆346Updated 4 months ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,569Updated 6 months ago
- see README☆327Updated 3 months ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆224Updated 2 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆170Updated 3 months ago
- This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial ne…☆515Updated 5 years ago