Masao-Someki / ConformerLinks

Pytorch implementation of Conformer block.

☆21

Alternatives and similar repositories for Conformer

Users that are interested in Conformer are comparing it to the libraries listed below

Sorting:

archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 6 months ago
KrishnaDN / Keyword-Transformer
Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆23Updated 4 years ago
pika-online / AESRC2020
a deep accent recognition network
☆48Updated 3 years ago
biyoml / End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
☆32Updated 3 years ago
DemisEom / RNNT-pytorch
Implementaion RNN tranceducer
☆23Updated 6 years ago
jingyonghou / KWS_Max-pooling_RHE
Mining effective negative training samples for keyword spotting (PyTorch)
☆61Updated 5 years ago
janson9192 / autokws2021
☆13Updated 4 years ago
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆40Updated 2 years ago
oshindow / Transformer-Transducer
A pytorch_lightning reimplementation of the Transducer module from ESPnet.
☆77Updated 4 years ago
R1ckShi / AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…
☆55Updated 4 years ago
m-koichi / ConformerSED
☆30Updated 4 years ago
MihawkHu / DCASE2020_task1
Code for DCASE 2020 task 1a and task 1b.
☆87Updated 3 years ago
double22a / asr_nlp_paper_code
Papers of ASR, Tools of ASR
☆41Updated 5 months ago
RicherMans / PSL
Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"
☆30Updated 3 years ago
cageyoko / CTC-Attention-Mispronunciation
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆61Updated 4 years ago
foamliu / Speech-Transformer
PyTorch re-implementation of Speech-Transformer
☆101Updated 3 years ago
foamliu / Listen-Attend-Spell-v2
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
☆38Updated 5 years ago
iariav / End-to-End-VAD
an Audio-Visual Voice Activity Detection using Deep Learning
☆49Updated 6 years ago
MingLunHan / CIF-PyTorch
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…
☆73Updated 6 months ago
KimJeongSun / SpecAugment_numpy_scipy
fast SpecAugmentation code with numpy and scipy
☆31Updated 6 years ago
YUCHEN005 / DPSL-ASR
Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
☆41Updated 2 years ago
SilvrDuck / AccentedSpeechRecognition
Experiments on speech recognition robustness to accents and dialects
☆12Updated 6 years ago
zengchang233 / Speaker_Verification_Tencent
Deep Discriminative Embeddings for Duration Robust Speaker Verification
☆19Updated 5 years ago
jiay7 / wenet_onlinedecode
Went online decode demo
☆30Updated 4 years ago
Kirili4ik / kws-attention-pytorch
Keyword spotting for audio with attention (KWS model for audio)
☆18Updated 4 years ago
echocatzh / torch-mfcc
A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.
☆77Updated 2 years ago
desh2608 / gmm-hmm-asr
Python implementation of simple GMM and HMM models for isolated digit recognition.
☆65Updated 4 years ago
danFromTelAviv / key_words_spotting
implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"
☆36Updated 5 years ago
mispchallenge / MISP2021-AVSR
repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"
☆17Updated 3 years ago
MarkWuNLP / SemanticMask
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆38Updated 5 years ago