andi611 / Conditional-SpecGAN-TensorflowLinks
Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network
☆10Updated 6 years ago
Alternatives and similar repositories for Conditional-SpecGAN-Tensorflow
Users that are interested in Conditional-SpecGAN-Tensorflow are comparing it to the libraries listed below
Sorting:
- https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques …☆26Updated 8 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information☆28Updated 8 years ago
- Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.☆40Updated 6 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 7 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 7 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Python implementation of the "Shazam" algorithm☆52Updated 6 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆24Updated 5 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Updated 8 years ago
- Audio Classification - Multilayer Neural Networks using TensorFlow☆28Updated 8 years ago
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- Enhancment of Audio Quality (Bit-Depth and Sampling-Rate) using Deep Learning.☆33Updated 5 years ago
- Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…☆14Updated 4 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆63Updated 4 years ago
- MelNet-Tensorflow implementation☆40Updated 4 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 7 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 7 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆69Updated 7 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆98Updated 3 years ago
- A fast cnn-based vocoder☆78Updated 5 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated 2 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago