PoKoHA / ASR-Conformer
Conformer: Convolution-augmented Transformer for Speech Recognition
☆9Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ASR-Conformer
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆38Updated last year
- ☆13Updated 4 months ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆34Updated 10 months ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆27Updated 4 months ago
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆25Updated 6 months ago
- ☆32Updated 3 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Updated 3 months ago
- ☆31Updated 2 years ago
- Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verificat…☆16Updated 7 months ago
- ☆45Updated last year
- Conformer encoder + Transformer decoder with Hybrid CTC/attention☆12Updated 3 years ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆54Updated 6 months ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆23Updated 2 years ago
- ☆26Updated 10 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated 7 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆44Updated 2 years ago
- 语音增强TFCN论文复现☆39Updated 2 years ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆51Updated 3 years ago
- Unofficial implementation of Dual-Path Transformer Network (DPTNet) for speech separation (Interspeech 2020)☆43Updated 3 years ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆16Updated 2 weeks ago
- ☆136Updated last year
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆28Updated last month
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆59Updated 2 years ago
- ☆86Updated 3 years ago
- Multilingual datasets with raw audio for speech emotion recognition☆19Updated 3 years ago
- ☆74Updated 2 months ago
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆87Updated last year
- ☆29Updated 2 years ago
- ☆14Updated 2 years ago
- ☆31Updated 3 years ago
- ☆105Updated 3 years ago