andi611 / Conditional-SpecGAN-TensorflowLinks
Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network
☆10Updated 6 years ago
Alternatives and similar repositories for Conditional-SpecGAN-Tensorflow
Users that are interested in Conditional-SpecGAN-Tensorflow are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of TTS Style Transfer☆23Updated 3 years ago
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 7 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 6 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- ☆31Updated 6 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆24Updated 4 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆34Updated 7 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Updated 4 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆27Updated 4 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 5 years ago
- ☆21Updated 7 years ago
- wavenet vocoder using tensorflow☆26Updated 7 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- pytorch implementation of lyre.ai's char2wav model☆32Updated 8 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information☆27Updated 7 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆13Updated 5 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 7 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.☆29Updated 6 years ago
- Real-time melgan based on cpu !!!☆13Updated 5 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Updated 7 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 5 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- An implementation of Tacotron2 (excluding WaveNet-vocoder) in TensorFlow.☆18Updated 7 years ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆22Updated 3 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago