tongjinle123 / speech_recognition

end2end asr system with ctc + dynamic cnn transformer, well organized using custom template

☆7

Alternatives and similar repositories for speech_recognition:

Users that are interested in speech_recognition are comparing it to the libraries listed below

eastonYi / end-to-end_asr_pytorch
Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch
☆22Updated 4 years ago
R1ckShi / AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…
☆55Updated 4 years ago
JaesungBae / Speech-Command-Recognition-with-Capsule-Network
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
☆25Updated 5 years ago
Xiaoxiaohuangg / LAS-Chinese-pytorch
Listen, Attend and Spell - PyTorch Implementation
☆17Updated 6 years ago
biyoml / End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
☆30Updated 3 years ago
lightning830 / E2E-audio-speech-recognition
Conformer encoder + Transformer decoder with Hybrid CTC/attention
☆12Updated 3 years ago
Magic-Bubble / SpeechProcessForMachineLearning
用于机器学习的语音特征提取，包含FBank和MFCC等，原理讲解和step by step的实现
☆51Updated 5 years ago
jiay7 / wenet_onlinedecode
Went online decode demo
☆29Updated 3 years ago
iariav / End-to-End-VAD
an Audio-Visual Voice Activity Detection using Deep Learning
☆48Updated 5 years ago
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆39Updated 2 years ago
LeeYongHyeok / DCM_vgg_transformer
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆12Updated 4 years ago
DemisEom / RNNT-pytorch
Implementaion RNN tranceducer
☆21Updated 5 years ago
jingyonghou / KWS_Max-pooling_RHE
Mining effective negative training samples for keyword spotting (PyTorch)
☆58Updated 4 years ago
yufan-aslp / AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…
☆117Updated 2 years ago
marc-moreaux / audioset_raw
Download and create a tfreader for the audioset dataset
☆16Updated 4 years ago
idiap / CNN_QbE_STD
Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"
☆32Updated 6 years ago
pika-online / AESRC2020
a deep accent recognition network
☆48Updated 3 years ago
zw76859420 / ASR_Phone
以音素建模构建NN-CTC声学模型
☆15Updated 5 years ago
oshindow / Transformer-Transducer
A pytorch_lightning reimplementation of the Transducer module from ESPnet.
☆75Updated 3 years ago
staplesinLA / denoising_DIHARD18
☆59Updated 4 years ago
fengxin-bupt / Application-of-Word2vec-in-Phoneme-Recognition
Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.
☆29Updated 5 years ago
zengchang233 / GMM_baseline
未来杯语音赛道说话人识别的baseline
☆48Updated 5 years ago
mayank-git-hub / ETE-Speech-Recognition
Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch
☆25Updated 5 months ago
ZhengkunTian / Speech-Tranformer-Pytorch
Seq2Seq Speech Recognition with Transformer on Mandarin Chinese
☆116Updated 5 years ago
gemengtju / SpEx_Plus
SpEx+(tied) source code
☆77Updated last year
zzpDapeng / speech_data_augment
A summary of speech data augment algorithms
☆68Updated 4 years ago
zzpDapeng / Transformer-Transducer
A streamable speech recognition model with transformer encoders and RNN-T loss
☆11Updated 3 years ago
liyongze / lstm_speaker_verification
☆35Updated 5 years ago
foamliu / Speech-Transformer
PyTorch re-implementation of Speech-Transformer
☆100Updated 3 years ago