NKU-HLT / PB-DSRLinks
[Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation
☆12Updated last year
Alternatives and similar repositories for PB-DSR
Users that are interested in PB-DSR are comparing it to the libraries listed below
Sorting:
- The official repository of Dynamic-SUPERB.☆198Updated 6 months ago
- public child-adult speaker diarization/classification model and codes☆17Updated 8 months ago
- UT-Sarulab MOS prediction system using SSL models☆291Updated last year
- ☆91Updated 8 months ago
- ☆11Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆122Updated last year
- A collection of dataset consists of a total of 8 English speech datasets for SER☆30Updated last year
- Script to perform statistical significance test between ASR hypotheses.☆22Updated 8 years ago
- ☆14Updated 3 months ago
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆40Updated 2 years ago
- ☆12Updated 8 months ago
- ☆176Updated last year
- SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods☆20Updated 4 months ago
- Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition☆15Updated last year
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆31Updated last year
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆180Updated 4 months ago
- Reference-aware automatic speech evaluation toolkit☆176Updated last year
- ☆121Updated 3 years ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆230Updated last year
- Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"☆119Updated 5 months ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆112Updated 2 years ago
- ☆156Updated 3 years ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Updated 2 years ago
- Mel cepstral distortion (MCD) computations in python.☆229Updated 8 years ago
- UTokyo-SaruLab MOS Prediction System☆283Updated 3 weeks ago
- Retrieval-Augmented MOS Prediction with Prior Knowledge Integration☆32Updated 9 months ago
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆177Updated 8 months ago
- ☆45Updated 2 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆77Updated 3 years ago
- ☆62Updated last year