PoKoHA / ASR-ConformerLinks
Conformer: Convolution-augmented Transformer for Speech Recognition
☆15Updated 5 months ago
Alternatives and similar repositories for ASR-Conformer
Users that are interested in ASR-Conformer are comparing it to the libraries listed below
Sorting:
- ☆157Updated 3 years ago
- ☆103Updated 4 years ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43Updated 2 years ago
- This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.☆17Updated last year
- Official repository of NeXt-TDNN for speaker verification☆81Updated last year
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Updated last year
- A PyTorch implementation of End-to-End Neural Diarization☆109Updated 2 years ago
- ☆41Updated last year
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Updated last year
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆248Updated last month
- ☆59Updated last year
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆78Updated 3 years ago
- ☆54Updated last year
- ☆61Updated 2 years ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated 2 years ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Updated 3 years ago
- Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch☆106Updated 5 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- ☆58Updated 10 months ago
- ☆32Updated 3 years ago
- ☆12Updated 3 years ago
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Updated last year
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆92Updated 2 years ago
- ☆26Updated 3 years ago
- ☆91Updated 9 months ago
- [Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection☆37Updated 10 months ago
- [INTERSPEECH'24] Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection☆54Updated last year
- ☆31Updated 3 years ago
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆71Updated 2 years ago