C3Imaging / whisper_child_asrLinks
☆10Updated 2 years ago
Alternatives and similar repositories for whisper_child_asr
Users that are interested in whisper_child_asr are comparing it to the libraries listed below
Sorting:
- public child-adult speaker diarization/classification model and codes☆17Updated 7 months ago
- Some comprehensive papers about speaker diarization☆322Updated 6 months ago
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆683Updated 11 months ago
- A collection of dataset consists of a total of 8 English speech datasets for SER☆30Updated 11 months ago
- UT-Sarulab MOS prediction system using SSL models☆285Updated last year
- Variational Bayes HMM over x-vectors diarization☆280Updated last year
- Update ASR paper everyday☆406Updated this week
- The official repository of Dynamic-SUPERB.☆197Updated 5 months ago
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆183Updated 2 months ago
- Versatile Evaluation of Speech and Audio☆364Updated last week
- ☆91Updated 7 months ago
- Diarization scoring tools.☆260Updated 2 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆241Updated last week
- Layer-wise analysis of self-supervised pre-trained speech representations☆120Updated last year
- A PyTorch implementation of End-to-End Neural Diarization☆109Updated 2 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆77Updated 3 years ago
- [IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer☆210Updated 3 weeks ago
- Script to perform statistical significance test between ASR hypotheses.☆22Updated 8 years ago
- UTokyo-SaruLab MOS Prediction System☆271Updated last week
- Audio Captioning datasets for PyTorch.☆125Updated 5 months ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆178Updated 3 months ago
- Official repository of SepReformer for speech separation☆233Updated 11 months ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆390Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆159Updated last week
- ☆58Updated 8 months ago
- End-to-End Neural Diarization☆418Updated 4 years ago
- ☆176Updated last year
- ☆21Updated last year
- Multilingual datasets with raw audio for speech emotion recognition☆30Updated 4 years ago
- Toolkit for downloading and processing Google's AudioSet dataset.☆173Updated 3 months ago