sooftware / RNN-TransducerLinks

PyTorch implementation of RNN-Transducer(RNN-T).

☆78

Alternatives and similar repositories for RNN-Transducer

Users that are interested in RNN-Transducer are comparing it to the libraries listed below

Sorting:

sooftware / speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
☆64Updated 3 years ago
lorenlugosch / transducer-tutorial
Example code for a neural transducer model.
☆65Updated last year
sooftware / lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆46Updated 4 years ago
upskyy / ContextNet
PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…
☆38Updated 3 years ago
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆108Updated 3 years ago
farisalasmary / wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Updated 3 years ago
TeaPoly / Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
☆44Updated 2 years ago
ynop / py-ctc-decode
CTC Decoder implementation with python only. Also supports language model decoding using KenLM.
☆37Updated last year
cornerfarmer / ctc_segmentation
Segment a given audio into utterances using a trained end-to-end ASR model.
☆73Updated 4 years ago
k2-fsa / fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆143Updated last year
iamjanvijay / rnnt
An implementation of RNN-Transducer loss in TF-2.0.
☆45Updated 2 years ago
JoungheeKim / Non-Attentive-Tacotron
This is Pytorch Implementation of Google's Non-attentive Tacotron.
☆57Updated 2 years ago
zldzmfoq12 / VCtube
A pakage for crawling audio from Youtube
☆42Updated last year
upskyy / Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
☆143Updated 2 years ago
tts-tutorial / interspeech2022
☆163Updated 2 years ago
burchim / EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
☆216Updated 2 years ago
idiap / contextual-biasing-on-gpus
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆20Updated last year
asappresearch / wav2seq
Official code for Wav2Seq
☆95Updated 3 years ago
speech-paper-reading / speech-paper-reading
Repository for speech paper reading
☆33Updated 3 years ago
espnet / notebook
☆69Updated last month
RuABraun / texterrors
☆37Updated 3 months ago
TeaPoly / CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆58Updated last year
sooftware / openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
☆35Updated 3 years ago
pzelasko / kaldialign
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆67Updated 2 months ago
lingjzhu / clap-ipa
Keyword spotting and forced alignment in any language
☆63Updated 3 weeks ago
oshindow / Transformer-Transducer
A pytorch_lightning reimplementation of the Transducer module from ESPnet.
☆77Updated 4 years ago
1ytic / warp-rna
Recurrent Neural Aligner
☆50Updated 5 years ago
ldong1111 / GraphemeBERT
This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models
☆46Updated 3 years ago
asappresearch / multistream-cnn
Multistream CNN for Robust Acoustic Modeling
☆40Updated 4 years ago
tango4j / Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
☆121Updated 3 years ago