using world vocoder to extract features and make data for training neural networks
☆11Oct 9, 2017Updated 8 years ago
Alternatives and similar repositories for extract_features_using_world
Users that are interested in extract_features_using_world are comparing it to the libraries listed below
Sorting:
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Jan 30, 2019Updated 7 years ago
- Interface for running Praat scripts through Python☆17May 16, 2025Updated 9 months ago
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- Audio streaming transfer demo with google.api.HttpBody and grpc gateway for speech synthesis☆20Jan 28, 2020Updated 6 years ago
- Speech synthesis platform based on tensorflow and sonnet☆60May 16, 2019Updated 6 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Oct 14, 2019Updated 6 years ago
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- 一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.☆24Jul 13, 2019Updated 6 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- High-level API for tar-based dataset☆12Feb 3, 2024Updated 2 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆52Jun 17, 2025Updated 8 months ago
- ☆51Feb 15, 2019Updated 7 years ago
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Oct 28, 2019Updated 6 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆90Jul 6, 2023Updated 2 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- WaveGlow vocoder with VQVAE☆61Jun 18, 2019Updated 6 years ago
- pytorch implementation of DNN-HSMM for TTS☆69Mar 14, 2021Updated 4 years ago
- ☆18Jan 17, 2022Updated 4 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- Noise generators for vocoder☆19Dec 31, 2018Updated 7 years ago
- ☆20Jun 5, 2022Updated 3 years ago
- Audio samples from ICML2019 "Almost Unsupervised Text to Speech and Automatic Speech Recognition"☆17May 14, 2019Updated 6 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- A collection of examples demonstrating how we can build speech synthesis systems using nnmnkwii.☆71May 15, 2020Updated 5 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Mar 19, 2021Updated 4 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- An implementation of Tacotron and Tacotron2☆80Aug 4, 2021Updated 4 years ago
- Mutiband version of HIFIGAN☆19Nov 6, 2020Updated 5 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- Phone generation model/VAE/GAN/VAE+GAN☆20Jun 26, 2018Updated 7 years ago
- ☆24Mar 15, 2022Updated 3 years ago
- ☆24Oct 9, 2018Updated 7 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Apr 9, 2019Updated 6 years ago