buriburisuri / speech-to-text-wavenet
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
β3,965Updated 3 years ago
Alternatives and similar repositories for speech-to-text-wavenet:
Users that are interested in speech-to-text-wavenet are comparing it to the libraries listed below
- πSpeech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networksβ2,163Updated last year
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflowβ2,841Updated last year
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Modelβ1,827Updated 3 years ago
- A TensorFlow implementation of DeepMind's WaveNet paperβ5,426Updated last year
- Facebook AI Research's Automatic Speech Recognition Toolkitβ6,401Updated last month
- Speedy Wavenet generation using dynamic programmingβ1,762Updated 7 years ago
- Deep neural networks for voice conversion (voice style transfer) in Tensorflowβ3,929Updated 2 years ago
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis modelsβ1,972Updated last year
- This is now the official location of the Merlin project.β1,307Updated 4 years ago
- A Flow-based Generative Network for Speech Synthesisβ2,300Updated last year
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)β2,965Updated last year
- WaveNet vocoderβ2,337Updated last year
- DeepMind's Tacotron-2 Tensorflow implementationβ2,289Updated last year
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.β1,862Updated 2 years ago
- Speech Recognition using DeepSpeech2.β2,116Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkitβ3,742Updated 3 years ago
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthβ¦β3,007Updated last year
- Deep learning library featuring a higher-level API for TensorFlow.β9,615Updated 8 months ago
- A method to generate speech across multiple speakersβ872Updated 5 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,β¦β2,374Updated 2 years ago
- Keras WaveNet implementationβ1,054Updated last year
- The official repository of the Eesen projectβ825Updated 5 years ago
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Rasβ¦β25,606Updated 4 months ago
- Interactive Image Generation via Generative Adversarial Networksβ3,981Updated 4 years ago
- A general-purpose encoder-decoder framework for Tensorflowβ5,606Updated 4 years ago
- β488Updated 7 years ago
- kaldi-asr/kaldi is the official location of the Kaldi project.β14,445Updated last month
- This library provides common speech features for ASR including MFCCs and filterbank energies.β2,382Updated 3 years ago
- Tensorflow Implementation of Deep Voice 3β453Updated 6 years ago
- End-to-End Speech Processing Toolkitβ8,686Updated this week