stevel705 / Tacotron-2-kerasLinks
Keras implementations of Tacotron-2
☆27Updated 4 years ago
Alternatives and similar repositories for Tacotron-2-keras
Users that are interested in Tacotron-2-keras are comparing it to the libraries listed below
Sorting:
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated 2 years ago
- ☆76Updated 3 years ago
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated 2 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆42Updated 3 years ago
- Grapheme To Phoneme☆73Updated last year
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated last year
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 5 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 6 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- PyTorch end-to-end speech recognition☆49Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆53Updated 5 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 4 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆155Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 3 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago