kjw11 / CSEnet-ASRLinks
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆10Updated 2 months ago
Alternatives and similar repositories for CSEnet-ASR
Users that are interested in CSEnet-ASR are comparing it to the libraries listed below
Sorting:
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 4 months ago
- ☆9Updated last year
- ☆10Updated 6 months ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆18Updated last week
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 2 months ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- A neural speech codec based on discrete WavLM representations☆24Updated 9 months ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆10Updated 7 months ago
- Whisper Speech Quality Assessment (WhiSQA)☆9Updated 6 months ago
- ☆15Updated last year
- ☆16Updated last year