C3Imaging / whisper_child_asrLinks
☆10Updated 2 years ago
Alternatives and similar repositories for whisper_child_asr
Users that are interested in whisper_child_asr are comparing it to the libraries listed below
Sorting:
- public child-adult speaker diarization/classification model and codes☆16Updated 7 months ago
- A collection of dataset consists of a total of 8 English speech datasets for SER☆30Updated 10 months ago
- ☆90Updated 7 months ago
- Variational Bayes HMM over x-vectors diarization☆277Updated last year
- Some comprehensive papers about speaker diarization☆320Updated 6 months ago
- UT-Sarulab MOS prediction system using SSL models☆282Updated last year
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆76Updated 3 years ago
- The official repository of Dynamic-SUPERB.☆196Updated 5 months ago
- Sound Source Localization for AI Grand Challenge 2021☆22Updated 3 years ago
- Multilingual datasets with raw audio for speech emotion recognition☆30Updated 4 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆233Updated 2 weeks ago
- A PyTorch implementation of End-to-End Neural Diarization☆108Updated 2 years ago
- Versatile Evaluation of Speech and Audio☆360Updated last month
- Official repository of SepReformer for speech separation☆231Updated 10 months ago
- Update ASR paper everyday☆382Updated this week
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆683Updated 11 months ago
- UTokyo-SaruLab MOS Prediction System☆262Updated last month
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆157Updated last week
- CHiME-9 Task 1 - MCoRec baseline☆24Updated 4 months ago
- ☆58Updated 8 months ago
- Audio Captioning datasets for PyTorch.☆124Updated 4 months ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆120Updated last year
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆182Updated 2 months ago
- NeMo: a toolkit for conversational AI☆13Updated last year
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆177Updated 3 months ago
- End-to-End Neural Diarization☆415Updated 4 years ago
- ☆173Updated last year
- Diarization scoring tools.☆259Updated 2 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆83Updated 5 months ago
- Official repository of NeXt-TDNN for speaker verification☆79Updated last year