stevel705 / Tacotron-2-keras
Keras implementations of Tacotron-2
☆27Updated 3 years ago
Alternatives and similar repositories for Tacotron-2-keras:
Users that are interested in Tacotron-2-keras are comparing it to the libraries listed below
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 2 years ago
- Deep Convolution Text to Speech☆35Updated 6 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated last year
- End-to-End Speech Recognition Using Tensorflow☆42Updated last year
- Grapheme to phoneme model for PyTorch☆40Updated 2 years ago
- Implementation of Multi speaker TTS☆50Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Updated 3 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 5 years ago
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated last year
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- PyTorch end-to-end speech recognition☆49Updated 4 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 3 years ago
- Non-Parallel Voice Conversion with Cyclic Variational Autoencoder☆52Updated 4 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆30Updated 4 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆72Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago