flashlight / wav2letterLinks

Facebook AI Research's Automatic Speech Recognition Toolkit

☆6,436

Alternatives and similar repositories for wav2letter

Users that are interested in wav2letter are comparing it to the libraries listed below

Sorting:

keithito / tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
☆2,981Updated 2 years ago
buriburisuri / speech-to-text-wavenet
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
☆3,984Updated 3 years ago
kaldi-asr / kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,029Updated 2 weeks ago
SeanNaren / deepspeech.pytorch
Speech Recognition using DeepSpeech2.
☆2,129Updated 2 years ago
google / uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…
☆1,582Updated 10 months ago
mravanelli / pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…
☆2,389Updated 3 years ago
NVIDIA / OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
☆1,561Updated 4 years ago
r9y9 / deepvoice3_pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
☆1,979Updated last year
zzw922cn / Automatic_Speech_Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
☆2,842Updated 2 years ago
pannous / tensorflow-speech-recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
☆2,172Updated last year
syhw / wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
☆1,869Updated 3 years ago
Kyubyong / tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
☆1,832Updated 3 years ago
NVIDIA / waveglow
A Flow-based Generative Network for Speech Synthesis
☆2,333Updated last year
Rayhane-mamah / Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
☆2,309Updated 2 years ago
espnet / espnet
End-to-End Speech Processing Toolkit
☆9,365Updated this week
mozilla / DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…
☆26,554Updated last month
CSTR-Edinburgh / merlin
This is now the official location of the Merlin project.
☆1,316Updated 5 years ago
tensorflow / lingvo
Lingvo
☆2,848Updated last month
zzw922cn / awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…
☆3,056Updated last year
facebookresearch / pytext
A natural language modeling framework based on PyTorch
☆6,326Updated 2 years ago
fatchord / WaveRNN
WaveRNN Vocoder + TTS
☆2,164Updated 3 years ago
NVIDIA / tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆5,257Updated last year
andabi / deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
☆3,938Updated 2 years ago
coqui-ai / STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
☆2,489Updated last year
freewym / espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆941Updated 11 months ago
r9y9 / wavenet_vocoder
WaveNet vocoder
☆2,359Updated 2 years ago
cmusphinx / pocketsphinx
A small speech recognizer
☆4,166Updated this week
alumae / kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
☆1,086Updated last year
pykaldi / pykaldi
A Python wrapper for Kaldi
☆1,022Updated 6 months ago
resemble-ai / Resemblyzer
A python package to analyze and compare voices with deep learning
☆3,055Updated last year