msalhab96 / RNN-TransducerLinks

PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper

☆12

Alternatives and similar repositories for RNN-Transducer

Users that are interested in RNN-Transducer are comparing it to the libraries listed below

Sorting:

lorenlugosch / transducer-tutorial
Example code for a neural transducer model.
☆64Updated last year
iiscleap / NISP-Dataset
☆30Updated 2 years ago
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 6 months ago
navana-tech / baseline_recipe_is21s_indic_asr_challenge
Multilingual and code-switching ASR challenges for low resource Indian languages.
☆21Updated 3 years ago
ga642381 / Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
☆110Updated last year
Srijith-rkr / KAUST-Whisper-Adapter
INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!
☆36Updated last year
mayank-git-hub / ETE-Speech-Recognition
Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch
☆26Updated 11 months ago
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆107Updated 3 years ago
pohanchi / AALBERT
The official repository for Audio ALBERT
☆66Updated 3 years ago
michen00 / unified_multilingual_dataset_of_emotional_human_utterances
A unified dataset of multilingual emotional human utterances
☆26Updated 3 years ago
BUTSpeechFIT / AMI-diarization-setup
☆54Updated last year
burchim / EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
☆216Updated 2 years ago
nttcslab-sp / EEND-vector-clustering
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…
☆78Updated 2 years ago
farisalasmary / wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Updated 3 years ago
chimechallenge / C8DASR-Baseline-NeMo
NeMo: a toolkit for conversational AI
☆13Updated last year
luferrer / ConfidenceIntervals
Confidence interval computation for evaluation in machine learning using the bootstrapping approach
☆86Updated last year
archiki / ASR-Accent-Analysis
Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.
☆15Updated 5 years ago
billzyx / WavBERT
☆22Updated last year
revdotcom / speech-datasets
Various speech datasets made available to the public
☆123Updated 7 months ago
Xflick / EEND_PyTorch
A PyTorch implementation of End-to-End Neural Diarization
☆108Updated 2 years ago
k2-fsa / fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆142Updated last year
desh2608 / diarizer
Clustering-based methods for overlapping diarization
☆81Updated last year
pyyush / SpecAugment
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆83Updated 4 years ago
talhanai / wer-sigtest
Script to perform statistical significance test between ASR hypotheses.
☆22Updated 7 years ago
NickRuiz / power-asr
Phonetically-Oriented Word Error Rate
☆35Updated 6 years ago
declare-lab / speech-adapters
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…
☆43Updated 2 years ago
jindongwang / EasyEspnet
Making Espnet easier to use
☆55Updated 4 years ago
csukuangfj / transducer-loss-benchmarking
☆68Updated 3 years ago
ankitapasad / layerwise-analysis
Layer-wise analysis of self-supervised pre-trained speech representations
☆108Updated 9 months ago
m3hrdadfi / soxan
Wav2Vec for speech recognition, classification, and audio classification
☆265Updated 3 years ago