EMOsuperb / EMO-SUPERB-submission
EMO-SUPERB submission
☆42Updated 5 months ago
Alternatives and similar repositories for EMO-SUPERB-submission:
Users that are interested in EMO-SUPERB-submission are comparing it to the libraries listed below
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆81Updated last year
- ☆63Updated 5 months ago
- ☆43Updated 2 years ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆42Updated 8 months ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆32Updated 7 months ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆25Updated 5 months ago
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆48Updated 7 months ago
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆21Updated 10 months ago
- Official release of StyleTalk dataset.☆61Updated 7 months ago
- MSP-Podcast Challenge Baseline Code☆20Updated 8 months ago
- ☆19Updated last year
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆39Updated last year
- The open source code for LLM-Codec☆126Updated 6 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆85Updated 3 months ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆39Updated last year
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…☆57Updated 7 months ago
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆53Updated 2 weeks ago
- ☆48Updated 3 months ago
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆51Updated 10 months ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆53Updated 8 months ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Updated last year
- ☆153Updated 7 months ago
- ☆65Updated last year
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".☆119Updated last month
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆139Updated last year
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆107Updated last year
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆63Updated 3 months ago
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆27Updated 5 months ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆22Updated 2 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆47Updated 3 months ago