buriburisuri / speech-to-text-wavenet
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
β3,954Updated 3 years ago
Related projects β
Alternatives and complementary repositories for speech-to-text-wavenet
- πSpeech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networksβ2,166Updated 10 months ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflowβ2,845Updated last year
- A TensorFlow implementation of DeepMind's WaveNet paperβ5,415Updated last year
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Modelβ1,828Updated 2 years ago
- Facebook AI Research's Automatic Speech Recognition Toolkitβ6,390Updated 3 months ago
- Speedy Wavenet generation using dynamic programmingβ1,763Updated 7 years ago
- Deep neural networks for voice conversion (voice style transfer) in Tensorflowβ3,923Updated 2 years ago
- Speech Recognition using DeepSpeech2.β2,104Updated last year
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)β2,957Updated last year
- Facebook AI Research Sequence-to-Sequence Toolkitβ3,743Updated 3 years ago
- WaveNet vocoderβ2,327Updated last year
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis modelsβ1,969Updated 11 months ago
- A method to generate speech across multiple speakersβ872Updated 5 years ago
- A general-purpose encoder-decoder framework for Tensorflowβ5,605Updated 4 years ago
- Keras WaveNet implementationβ1,056Updated last year
- DeepMind's Tacotron-2 Tensorflow implementationβ2,276Updated last year
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Suβ¦β1,560Updated last month
- A Flow-based Generative Network for Speech Synthesisβ2,288Updated last year
- Open Source Neural Machine Translation in Torch (deprecated)β2,386Updated 4 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,β¦β2,367Updated 2 years ago
- A recurrent neural network for generating little stories about imagesβ2,961Updated 7 years ago
- This is now the official location of the Merlin project.β1,308Updated 4 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.β1,859Updated 2 years ago
- A PyTorch Implementation of End-to-End Models for Speech-to-Textβ754Updated last year
- This library provides common speech features for ASR including MFCCs and filterbank energies.β2,376Updated 3 years ago
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthβ¦β2,982Updated last year
- The official repository of the Eesen projectβ825Updated 5 years ago
- kaldi-asr/kaldi is the official location of the Kaldi project.β14,298Updated last month
- SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/β880Updated 3 years ago