andi611 / Conditional-SpecGAN-TensorflowLinks
Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network
☆10Updated 7 years ago
Alternatives and similar repositories for Conditional-SpecGAN-Tensorflow
Users that are interested in Conditional-SpecGAN-Tensorflow are comparing it to the libraries listed below
Sorting:
- Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.☆40Updated 7 years ago
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.☆29Updated 6 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆24Updated 5 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 7 years ago
- pytorch implementation of lyre.ai's char2wav model☆32Updated 8 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 7 years ago
- Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information☆28Updated 8 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 5 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated 2 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 6 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 7 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆58Updated 6 years ago
- wavenet vocoder using tensorflow☆26Updated 7 years ago
- MelNet-Tensorflow implementation☆40Updated 5 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆28Updated 3 years ago
- Voice Conversion using Tacotron.☆11Updated 3 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…☆14Updated 5 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 6 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Updated 7 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 6 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated 2 years ago
- Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structur…☆93Updated 7 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- DCASE2016 TASK1 Scene Classification☆12Updated 8 years ago
- Audio Classification using Image Classification☆48Updated 6 years ago
- A pytorch implementation of FFTNet.☆37Updated 7 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 6 years ago