juice500ml / dysarthria-gop
☆20Updated 3 months ago
Related projects: ⓘ
- Script to perform statistical significance test between ASR hypotheses.☆19Updated 7 years ago
- ☆39Updated last year
- ☆47Updated 4 months ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆12Updated 3 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆40Updated this week
- A list of papers for child ASR☆24Updated 5 months ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆40Updated last year
- Official repository of NeXt-TDNN for speaker verification☆48Updated 5 months ago
- ☆48Updated 11 months ago
- Keyword spotting and forced alignment in any language☆31Updated 2 months ago
- Clustering-based methods for overlapping diarization☆68Updated 8 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆26Updated last year
- Error correction back-end for speaker diarization☆9Updated 11 months ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆65Updated 5 months ago
- Phoneme segmentation using pre-trained speech models☆49Updated last year
- ☆69Updated this week
- ☆70Updated last month
- Layer-wise analysis of self-supervised pre-trained speech representations☆88Updated last month
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆31Updated last year
- ☆14Updated 2 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆32Updated 11 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆42Updated this week
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆31Updated last year
- Discriminative Training of VBx Diarization☆17Updated 7 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆78Updated 11 months ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆18Updated last year
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆37Updated 3 months ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆18Updated 11 months ago
- Reference-aware automatic speech evaluation toolkit☆95Updated 6 months ago
- ☆31Updated 3 years ago