Speech Recognition model based off of FAIR research paper built using Pytorch.
☆87Dec 11, 2018Updated 7 years ago
Alternatives and similar repositories for Wav2Letter
Users that are interested in Wav2Letter are comparing it to the libraries listed below
Sorting:
- An opensource speech-to-text software written in tensorflow☆160Oct 15, 2022Updated 3 years ago
- Implementation of the LOSSGRAD optimization algorithm☆15Mar 21, 2019Updated 6 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126May 20, 2020Updated 5 years ago
- Speech-to-text based on wav2letter built for transfer learning☆98Oct 21, 2022Updated 3 years ago
- Speech Recognition using DeepSpeech2.☆2,139Dec 13, 2022Updated 3 years ago
- MediaEval 2020: Music Mood Classification☆18Mar 5, 2021Updated 4 years ago
- ☆10Apr 8, 2024Updated last year
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 4 years ago
- pytorch implementation of "pix2face" network for 3D face estimation from 2D images☆12Jan 14, 2021Updated 5 years ago
- Image-source method for room acoustics☆14Feb 5, 2020Updated 6 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10May 1, 2025Updated 10 months ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Dec 11, 2025Updated 2 months ago
- MNASNet implementation and pre-trained model in PyTorch☆10Mar 20, 2019Updated 6 years ago
- Repository containing code for getting statistical guarantees on properties of BNNs☆13Apr 24, 2019Updated 6 years ago
- This repository contains the data used for the paper "Entity Recognition at First Sight: Improving NER with Eye Movement Information" by …☆11Jan 22, 2020Updated 6 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统☆48Jun 27, 2018Updated 7 years ago
- 将百度DeepSpeech的keras后端由theano改为tensorflow,整合mozilla解码模块进行中文语音识别模型部署☆10Dec 2, 2019Updated 6 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Oct 10, 2019Updated 6 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆42Dec 25, 2018Updated 7 years ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,212Dec 19, 2020Updated 5 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Jul 6, 2023Updated 2 years ago
- ☆15Oct 29, 2019Updated 6 years ago
- Tacotron implementation of pytorch☆12Sep 3, 2017Updated 8 years ago
- DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflow☆30Jan 16, 2018Updated 8 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆63Oct 15, 2019Updated 6 years ago
- Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing…☆835Jan 31, 2026Updated last month
- This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.☆35Oct 11, 2025Updated 4 months ago
- Codebase for the paper "Adversarial Attacks on Time Series"☆21Mar 1, 2019Updated 7 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Apr 7, 2022Updated 3 years ago
- Pytorch Implementation for "Preserving Linear Separability in Continual Learning by Backward Feature Projection" (CVPR 2023)☆18Jun 29, 2023Updated 2 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆181Jul 22, 2019Updated 6 years ago
- A temporal module for PyTorch-ComplexTensor☆44Jun 28, 2024Updated last year