sooftware / End-to-End-Speech-Recognition-Models
PyTorch implementation of automatic speech recognition models.
☆38Updated 4 years ago
Alternatives and similar repositories for End-to-End-Speech-Recognition-Models:
Users that are interested in End-to-End-Speech-Recognition-Models are comparing it to the libraries listed below
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆66Updated 3 years ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆34Updated 2 years ago
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆38Updated 5 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 4 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆45Updated 3 years ago
- Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.☆34Updated 3 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated last month
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 3 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 3 years ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆102Updated 2 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆57Updated last year
- Implementaion RNN tranceducer☆21Updated 5 years ago
- Conformer encoder + Transformer decoder with Hybrid CTC/attention☆12Updated 3 years ago
- A Fairseq implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.☆11Updated 4 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- Example code for a neural transducer model.☆61Updated last year
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated last year
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Updated 4 years ago
- End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆10Updated 3 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆25Updated 6 months ago
- Making Espnet easier to use☆54Updated 3 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Updated 5 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆76Updated 3 years ago
- End-to-end speech recognition on AISHELL dataset.☆31Updated 3 years ago
- VoxSRC Challenge☆31Updated 5 years ago