vshmyhlo / listen-attend-and-speell-pytorchLinks

Implementation of Automatic Speech Recognition inspired by "Listen, Attend and Spell" paper in PyTorch

☆11

Alternatives and similar repositories for listen-attend-and-speell-pytorch

Users that are interested in listen-attend-and-speell-pytorch are comparing it to the libraries listed below

Sorting:

m-wiesner / nnet_pytorch
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Updated 11 months ago
MyrtleSoftware / myrtlespeech
Speech recognition
☆8Updated 4 years ago
vivianngo97 / Punctuation_Transcription
A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.
☆15Updated 4 years ago
cyfer0618 / kaldi-pytorch-rnnlm
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Updated 5 years ago
qiujiali / lattice-rescore
☆16Updated 3 years ago
titu1994 / warprnnt_numba
WarpRNNT loss ported in Numba CPU/CUDA for Pytorch
☆17Updated 3 years ago
idiap / inv-tn
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)
☆21Updated 7 years ago
jfainberg / lattice_combination
Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices
☆16Updated last year
craffel / mocha
Example implementation of Monotonic Chunkwise Attention.
☆52Updated 7 years ago
igormq / ctcdecode-pytorch
Python implementation of CTC beam search decoder + agnostic LM scorer
☆19Updated 4 years ago
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
gpu-poor / gramvaani_hindi_asr
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆15Updated 3 years ago
RuABraun / texterrors
☆37Updated 2 months ago
artbataev / end2end
Losses and decoders for end-to-end ASR and OCR
☆34Updated 4 years ago
amazon-science / proteno
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45Updated 4 years ago
robflynnyh / long-context-asr
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆10Updated last month
EMRAI / emrai-synthetic-diarization-corpus
☆20Updated 6 years ago
hfutami / distill-bert-for-seq2seq-asr
☆24Updated 5 years ago
sigmorphon / 2020
SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…
☆36Updated 2 months ago
kate-egorova / ASR-hybrid-decoding
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Updated 5 years ago
awni / py-arpa-lm
Python API for reading and querying ARPA formatted language models.
☆33Updated 10 years ago
MiuLab / Lattice-Transformer-SLU
Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"
☆11Updated 4 years ago
markusdr / transducersaurus
Automatically exported from code.google.com/p/transducersaurus
☆11Updated 10 years ago
luomingshuang / k2-speechbrain
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Updated 3 years ago
SpeechColab / PySpeechColab
A library of speech gadgets.
☆13Updated 2 years ago
edufonseca / shift_sec
Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".
☆13Updated 2 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
☆12Updated 4 years ago
MiniXC / phones
A collection of utilities for handling IPA phones.
☆25Updated last year
revdotcom / words2num
Convert words to numbers
☆20Updated 3 years ago
BUTSpeechFIT / OOV-recovery-in-hybrid-ASR-system
☆9Updated 5 years ago