Miamoto / Conformer-NTM
☆13Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Conformer-NTM
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆14Updated 2 months ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆15Updated 3 years ago
- Discriminative Training of VBx Diarization☆18Updated last month
- Exploring Binary Classification Loss for Speaker Verification☆14Updated last year
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆26Updated last year
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆14Updated 4 months ago
- ☆31Updated 2 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆20Updated 3 weeks ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆15Updated last month
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆16Updated last week
- acnn for text-independent speaker recognition☆9Updated 2 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆18Updated last month
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- ☆31Updated 3 years ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆20Updated last month
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)