kjw11 / CSEnet-ASR
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆11Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for CSEnet-ASR
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆29Updated last month
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Updated last year
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆18Updated 2 months ago
- A toolkit dedicate for speech evaluation.☆18Updated last month
- ☆19Updated 2 months ago
- ☆15Updated 4 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆17Updated last month
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆20Updated 2 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 11 months ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆15Updated 2 weeks ago
- ☆13Updated last year
- ☆14Updated last year
- ☆32Updated 2 months ago
- A small tool to calculate the distribution of audio durations in a directory☆14Updated last year
- ☆26Updated last year
- A neural speech codec based on discrete WavLM representations☆21Updated 2 months ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆11Updated this week
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆12Updated 2 years ago
- ☆17Updated last month
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆15Updated 2 years ago
- ClearVoice☆13Updated this week
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆18Updated 3 months ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆38Updated 2 months ago
- ☆12Updated 2 years ago
- Official code of ElasticAST (Interspeech 2024 paper)☆23Updated 3 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated 5 months ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆10Updated 10 months ago
- ☆15Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆36Updated last month