MrSupW / ICMC-ASR_Baseline
The baseline system for the ICASSP2024 ICMC-ASR Challenge.
☆46Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for ICMC-ASR_Baseline
- ☆31Updated 3 years ago
- SpEx+(tied) source code☆75Updated last year
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS …☆93Updated last month
- A simple package for Guided source separation (GSS)☆107Updated 6 months ago
- Training data simulation☆42Updated 6 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆45Updated 2 months ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- ☆26Updated 10 months ago
- ☆43Updated 3 years ago
- ☆29Updated 2 years ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆16Updated 3 weeks ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- ☆31Updated 2 years ago
- multi-scale time domain speaker extraction☆59Updated 3 years ago
- Code for calculate DNS_MOS.☆31Updated last year
- Code for synchronising all CHiME-5 audio signals for use in CHiME-6☆18Updated 4 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆65Updated 6 months ago
- ☆74Updated 3 months ago
- Conferencing Speech Challenge☆90Updated 3 years ago
- Official data preparation scripts for the URGENT 2024 Challenge☆66Updated 3 months ago
- ☆32Updated 2 years ago
- Target Speaker Extraction Toolkit☆112Updated 2 weeks ago
- ☆50Updated last year
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆14Updated 5 months ago
- This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which w…☆78Updated 2 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆23Updated 7 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆27Updated last year
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆29Updated last month
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆108Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆16Updated 4 months ago