PoKoHA / ASR-ConformerLinks
Conformer: Convolution-augmented Transformer for Speech Recognition
☆15Updated 5 months ago
Alternatives and similar repositories for ASR-Conformer
Users that are interested in ASR-Conformer are comparing it to the libraries listed below
Sorting:
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Updated last year
- ☆156Updated 3 years ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Updated last year
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Updated 3 years ago
- ☆53Updated last year
- ☆32Updated 3 years ago
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Updated last year
- A PyTorch implementation of End-to-End Neural Diarization☆109Updated 2 years ago
- ☆14Updated last year
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆40Updated 2 years ago
- ☆41Updated last year
- Official implementation of Transpotter, published in BMVC 2021☆16Updated 3 years ago
- ☆58Updated 10 months ago
- Official repository of NeXt-TDNN for speaker verification☆81Updated last year
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆53Updated last year
- ☆91Updated 9 months ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Updated 3 years ago
- ☆26Updated 3 years ago
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆92Updated 2 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Updated last year
- ☆59Updated last year
- ☆103Updated 4 years ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆124Updated last year
- ☆61Updated 2 years ago
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆27Updated last year
- Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.☆29Updated 6 years ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆112Updated 3 years ago
- [INTERSPEECH'24] Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection☆54Updated last year