NKU-HLT / PB-DSRLinks
[Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation
☆12Updated last year
Alternatives and similar repositories for PB-DSR
Users that are interested in PB-DSR are comparing it to the libraries listed below
Sorting:
- public child-adult speaker diarization/classification model and codes☆18Updated 9 months ago
- ☆11Updated last year
- UT-Sarulab MOS prediction system using SSL models☆294Updated last year
- ☆176Updated last year
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Updated 2 years ago
- A collection of dataset consists of a total of 8 English speech datasets for SER☆30Updated last year
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆234Updated last year
- ☆14Updated 3 months ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆183Updated 5 months ago
- The official repository of Dynamic-SUPERB.☆197Updated 7 months ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Updated 2 years ago
- ☆91Updated 9 months ago
- ☆12Updated 9 months ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆123Updated last year
- ☆156Updated 3 years ago
- SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods☆22Updated 5 months ago
- Script to perform statistical significance test between ASR hypotheses.☆22Updated 8 years ago
- UTokyo-SaruLab MOS Prediction System☆290Updated last month
- A Survey of Spoken Dialogue Models (60 pages)☆314Updated last year
- ☆121Updated 3 years ago
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆179Updated 9 months ago
- Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition☆15Updated last year
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Updated 10 months ago
- ☆59Updated last year
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".☆205Updated last week
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆31Updated last year
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆112Updated 2 years ago
- EMO-SUPERB submission☆50Updated 3 months ago
- Retrieval-Augmented MOS Prediction with Prior Knowledge Integration☆32Updated 10 months ago
- ☆20Updated last year