msalhab96 / ConformerLinks

An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper

☆21

Alternatives and similar repositories for Conformer

Users that are interested in Conformer are comparing it to the libraries listed below

Sorting:

upskyy / Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
☆147Updated 3 years ago
Wadaboa / titanet
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
☆67Updated 3 years ago
khanld / ASR-Wav2vec-Finetune
Finetune Wa2vec 2.0 For Speech Recognition
☆142Updated 9 months ago
nii-yamagishilab / Attention_Backend_for_ASV
Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances
☆50Updated 3 years ago
HarunoriKawano / Wav2vec2.0
Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.
☆54Updated 2 years ago
pyyush / SpecAugment
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆90Updated 5 years ago
YuanGongND / psla
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
☆149Updated 2 years ago
msh9184 / contrastive-equilibrium-learning
☆21Updated 4 years ago
WangHelin1997 / MaskSpec
The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
☆44Updated 11 months ago
sooftware / speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
☆64Updated 4 years ago
nikvaessen / w2v2-speaker
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
☆145Updated 3 years ago
YChenL / DS-TDNN
Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch
☆41Updated 2 years ago
YuanGongND / vocalsound
Dataset and baseline code for the VocalSound dataset (ICASSP2022).
☆154Updated 3 years ago
burchim / EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
☆220Updated 2 years ago
frednam93 / FilterAugSED
☆67Updated last year
lightning830 / E2E-audio-speech-recognition
Conformer encoder + Transformer decoder with Hybrid CTC/attention
☆12Updated 4 years ago
IDRnD / VoxTube
The VoxTube dataset official repository
☆71Updated last year
dmlguq456 / NeXt_TDNN_ASV
Official repository of NeXt-TDNN for speaker verification
☆79Updated last year
sasv-challenge / SASVC2022_Baseline
Baseline for the Spoofing-aware Speaker Verification Challenge 2022
☆65Updated 3 years ago
TaoRuijie / Loss-Gated-Learning
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
☆91Updated 2 years ago
vectominist / MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Updated 2 years ago
joonson / voxceleb_unsupervised
Augmentation adversarial training for self-supervised speaker recognition
☆78Updated 4 years ago
mispchallenge / misp2022_baseline
☆31Updated 2 years ago
theolepage / sslsv
Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker Verification (SV).
☆34Updated 4 months ago
upskyy / ContextNet
PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…
☆38Updated 3 years ago
hechmik / voxceleb_enrichment_age_gender
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
☆70Updated 3 years ago
khanld / Wav2vec2-Pretraining
Wav2vec 2.0 Self-Supervised Pretraining
☆56Updated 9 months ago
umbertocappellazzo / Llama-AVSR
Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners" [ICASSP 2025] and "Mitigat…
☆48Updated last week
jreremy / conformer
Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.
☆26Updated last year
FrenchKrab / IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆91Updated 2 years ago