erasedwalt / CTC-ASR

An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models

☆11

Alternatives and similar repositories for CTC-ASR:

Users that are interested in CTC-ASR are comparing it to the libraries listed below

TeaPoly / CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆57Updated last year
DemisEom / RNNT-pytorch
Implementaion RNN tranceducer
☆22Updated 5 years ago
jingyonghou / KWS_Max-pooling_RHE
Mining effective negative training samples for keyword spotting (PyTorch)
☆59Updated 4 years ago
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆39Updated 2 years ago
juanmc2005 / SpeakerEmbeddingLossComparison
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…
☆59Updated 4 years ago
csukuangfj / optimized_transducer
Memory efficient transducer loss computation
☆68Updated 2 years ago
jiay7 / wenet_onlinedecode
Went online decode demo
☆29Updated 3 years ago
hbredin / DomainAdversarialVoiceActivityDetection
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
☆23Updated 5 years ago
TeaPoly / Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
☆44Updated 2 years ago
Mashiro009 / wenet-online-decoder-onnx
☆37Updated 3 years ago
athena-team / athena-decoder
☆76Updated 3 years ago
lenovo-voice / THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEM
☆50Updated 4 years ago
TowerYsable / ASR_awesome
语音识别论文前沿
☆44Updated 3 years ago
jsvir / vad
An official implementation of the ICASSP 2023 paper: SG-VAD: Stochastic Gates Based Speech Activity Detection
☆25Updated 8 months ago
csukuangfj / transducer-loss-benchmarking
☆68Updated 3 years ago
a-nagrani / VoxSRC2020
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020
☆42Updated 4 years ago
ZitengWang / python_kaldi_features
python codes to extract MFCC and FBANK speech features for Kaldi
☆65Updated 6 years ago
Mashiro009 / wenet-onnx
☆31Updated 3 years ago
MarkWuNLP / SemanticMask
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆38Updated 4 years ago
oshindow / Transformer-Transducer
A pytorch_lightning reimplementation of the Transducer module from ESPnet.
☆76Updated 4 years ago
k2-fsa / fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆141Updated last year
cornerfarmer / ctc_segmentation
Segment a given audio into utterances using a trained end-to-end ASR model.
☆73Updated 4 years ago
danFromTelAviv / key_words_spotting
implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"
☆36Updated 5 years ago
shakingWaves / LPCNet_torch
torch version of LPCNet
☆20Updated 4 years ago
skgusrb12 / voice_activity_detection
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆26Updated 4 years ago
R1ckShi / AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…
☆55Updated 4 years ago
wenet-e2e / WeTextProcessing.deprecated
☆61Updated 2 years ago
HaoranMiao / streaming-attention
streaming attention networks for end-to-end automatic speech recognition
☆55Updated 4 years ago
xiangxyq / minimize-chain-decoder
Minimize kaldi nnet3 chain decoder
☆45Updated 5 years ago
snsun / kaldi-decoder-code-reading
☆31Updated 2 years ago