LearnedVector/Wav2Letter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LearnedVector/Wav2Letter)

LearnedVector / Wav2Letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

☆87

Alternatives and similar repositories for Wav2Letter

Users that are interested in Wav2Letter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

louiskirsch / speechT
View on GitHub
An opensource speech-to-text software written in tensorflow
☆160Oct 15, 2022Updated 3 years ago
juliuskunze / speechless
View on GitHub
Speech-to-text based on wav2letter built for transfer learning
☆98Oct 21, 2022Updated 3 years ago
silversparro / wav2letter.pytorch
View on GitHub
A fully convolution-network for speech-to-text, built on pytorch.
☆126May 20, 2020Updated 6 years ago
SeanNaren / deepspeech.pytorch
View on GitHub
Speech Recognition using DeepSpeech2.
☆2,136Dec 13, 2022Updated 3 years ago
Sundy1219 / ctc_beam_search_lm
View on GitHub
CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统
☆49Jun 27, 2018Updated 8 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
juanmc2005 / torch-plda
View on GitHub
PyTorch implementation of PLDA as described in https://ravisoji.com/assets/papers/ioffe2006probabilistic.pdf
☆15Oct 16, 2020Updated 5 years ago
sunny8898 / DeepSpeech-tensorflow
View on GitHub
将百度DeepSpeech的keras后端由theano改为tensorflow，整合mozilla解码模块进行中文语音识别模型部署
☆10Dec 2, 2019Updated 6 years ago
bartwojcik / lossgrad
View on GitHub
Implementation of the LOSSGRAD optimization algorithm
☆15Mar 21, 2019Updated 7 years ago
lifelongeek / AAS_enhancement
View on GitHub
This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…
☆28Oct 10, 2019Updated 6 years ago
usc-sail / media-eval-2020
View on GitHub
MediaEval 2020: Music Mood Classification
☆18Mar 5, 2021Updated 5 years ago
HawkAaron / E2E-ASR
View on GitHub
PyTorch Implementations for End-to-End Automatic Speech Recognition
☆127Jun 10, 2019Updated 7 years ago
austinmoehle / wavernn
View on GitHub
WaveRNN-based waveform generator & demo of TensorFlow CuDNN-GRU usage.
☆24Aug 19, 2018Updated 7 years ago
Randl / MNASNet-pytorch
View on GitHub
MNASNet implementation and pre-trained model in PyTorch
☆10Mar 20, 2019Updated 7 years ago
ShankHarinath / DeepSpeech2-Keras
View on GitHub
DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflow
☆30Jan 16, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
DaoZhang0123 / compareCTCDecoder
View on GitHub
compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder
☆20Jul 10, 2018Updated 8 years ago
gentaiscool / end2end-asr-pytorch
View on GitHub
End-to-End Automatic Speech Recognition on PyTorch
☆304Jun 2, 2022Updated 4 years ago
Alexander-H-Liu / End-to-end-ASR-Pytorch
View on GitHub
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…
☆1,210Dec 19, 2020Updated 5 years ago
vincentqb / audio-tutorial
View on GitHub
Experiments and tutorials with and for torchaudio
☆13May 7, 2021Updated 5 years ago
edchengg / generative_model_speech
View on GitHub
Phone generation model/VAE/GAN/VAE+GAN
☆20Jun 26, 2018Updated 8 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
nii-yamagishilab / TSNetVocoder
View on GitHub
☆42Oct 30, 2018Updated 7 years ago
fearofchou / mmnet
View on GitHub
☆16Apr 10, 2019Updated 7 years ago
Xiaoxiaohuangg / LAS-Chinese-pytorch
View on GitHub
Listen, Attend and Spell - PyTorch Implementation
☆17Dec 28, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apoorvnandan / speech-recognition-primer
View on GitHub
This repository contains code for a tutorial on end to end automatic speech recognition.
☆18Sep 10, 2019Updated 6 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
flashlight / wav2letter
View on GitHub
Facebook AI Research's Automatic Speech Recognition Toolkit
☆6,439Jul 14, 2026Updated last week
vivianngo97 / Punctuation_Transcription
View on GitHub
A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.
☆15Aug 6, 2020Updated 5 years ago
davidwarne / Bayesian_SIMD_examples
View on GitHub
Supplementary Material to accompany the paper, DJ Warne, SA Sisson, C Drovandi (2019) Acceleration of expensive computations in Bayesian…
☆13Oct 23, 2020Updated 5 years ago
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
igormq / asr-study
View on GitHub
Implementation of all-neural speech recognition systems using Keras and Tensorflow
☆146Oct 12, 2017Updated 8 years ago
A-Jacobson / tacotron2
View on GitHub
pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf
☆42Mar 18, 2018Updated 8 years ago
Diamondfan / CTC_pytorch
View on GitHub
CTC end -to-end ASR for timit and 863 corpus.
☆219Dec 20, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zw76859420 / ASR_Syllable
View on GitHub
基于卷积神经网络的语音识别声学模型的研究
☆181Jul 22, 2019Updated 7 years ago
DS3Lab / ner-at-first-sight
View on GitHub
This repository contains the data used for the paper "Entity Recognition at First Sight: Improving NER with Eye Movement Information" by …
☆12Jan 22, 2020Updated 6 years ago
Deepest-Project / Transformer-TTS
View on GitHub
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆64Jul 6, 2023Updated 3 years ago
awni / transducer
View on GitHub
A Fast Sequence Transducer Implementation with PyTorch Bindings
☆200Sep 20, 2022Updated 3 years ago
githubharald / CTCDecoder
View on GitHub
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing…
☆837Jan 31, 2026Updated 5 months ago
githubharald / CTCWordBeamSearch
View on GitHub
Connectionist Temporal Classification (CTC) decoder with dictionary and language model.
☆578Jan 31, 2026Updated 5 months ago
robmsmt / ASR-Audio-Data-Links
View on GitHub
A list of publically available audio data that anyone can download for ASR or other speech activities
☆237Aug 6, 2021Updated 4 years ago