Speech Recognition model based off of FAIR research paper built using Pytorch.
☆87Dec 11, 2018Updated 7 years ago
Alternatives and similar repositories for Wav2Letter
Users that are interested in Wav2Letter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An opensource speech-to-text software written in tensorflow☆160Oct 15, 2022Updated 3 years ago
- Speech-to-text based on wav2letter built for transfer learning☆98Oct 21, 2022Updated 3 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126May 20, 2020Updated 5 years ago
- Speech Recognition using DeepSpeech2.☆2,140Dec 13, 2022Updated 3 years ago
- CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统☆48Jun 27, 2018Updated 7 years ago
- PyTorch implementation of PLDA as described in https://ravisoji.com/assets/papers/ioffe2006probabilistic.pdf☆15Oct 16, 2020Updated 5 years ago
- 将百度DeepSpeech的keras后端由theano改为tensorflow,整合mozilla解码模块进行中文语音识别模型部署☆10Dec 2, 2019Updated 6 years ago
- Implementation of the LOSSGRAD optimization algorithm☆15Mar 21, 2019Updated 7 years ago
- MediaEval 2020: Music Mood Classification☆18Mar 5, 2021Updated 5 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Oct 10, 2019Updated 6 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- pytorch implementation of "pix2face" network for 3D face estimation from 2D images☆12Jan 14, 2021Updated 5 years ago
- WaveRNN-based waveform generator & demo of TensorFlow CuDNN-GRU usage.☆24Aug 19, 2018Updated 7 years ago
- DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflow☆30Jan 16, 2018Updated 8 years ago
- Image-source method for room acoustics☆14Feb 5, 2020Updated 6 years ago
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆20Jul 10, 2018Updated 7 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- Phone generation model/VAE/GAN/VAE+GAN☆20Jun 26, 2018Updated 7 years ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,212Dec 19, 2020Updated 5 years ago
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 4 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- ☆42Oct 30, 2018Updated 7 years ago
- This repository contains code for a tutorial on end to end automatic speech recognition.☆17Sep 10, 2019Updated 6 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Dec 28, 2018Updated 7 years ago
- ☆76Mar 18, 2022Updated 4 years ago
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,445Jan 12, 2026Updated 2 months ago
- Supplementary Material to accompany the paper, DJ Warne, SA Sisson, C Drovandi (2019) Acceleration of expensive computations in Bayesian…☆13Oct 23, 2020Updated 5 years ago
- Implementation of all-neural speech recognition systems using Keras and Tensorflow☆145Oct 12, 2017Updated 8 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- ☆16Apr 10, 2019Updated 6 years ago
- CTC end -to-end ASR for timit and 863 corpus.☆219Dec 20, 2019Updated 6 years ago
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Mar 18, 2018Updated 8 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆231Aug 6, 2021Updated 4 years ago
- This repository contains the data used for the paper "Entity Recognition at First Sight: Improving NER with Eye Movement Information" by …☆11Jan 22, 2020Updated 6 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆181Jul 22, 2019Updated 6 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Jul 6, 2023Updated 2 years ago
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆200Sep 20, 2022Updated 3 years ago
- A Streamlit app to add structured tags to a dataset card☆22Jun 30, 2022Updated 3 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago