EMOsuperb / EMO-SUPERB-submissionLinks
EMO-SUPERB submission
☆42Updated 9 months ago
Alternatives and similar repositories for EMO-SUPERB-submission
Users that are interested in EMO-SUPERB-submission are comparing it to the libraries listed below
Sorting:
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆81Updated last year
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆45Updated 11 months ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆30Updated 3 months ago
- Official release of StyleTalk dataset.☆64Updated 11 months ago
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆30Updated 8 months ago
- ☆43Updated 2 years ago
- ☆67Updated 8 months ago
- The open source code for LLM-Codec☆134Updated 9 months ago
- ☆71Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆59Updated 7 months ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆78Updated 6 months ago
- ☆19Updated 2 years ago
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆133Updated last month
- Survey on speech generation work.☆19Updated last year
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Updated 2 years ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆91Updated 6 months ago
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆30Updated 5 months ago
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆49Updated 11 months ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆39Updated 11 months ago
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆51Updated last year
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…☆59Updated 11 months ago
- Audio-FLAN☆153Updated 2 months ago
- A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.☆73Updated last month
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27Updated this week
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆137Updated 5 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆58Updated 7 months ago
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆179Updated 10 months ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆56Updated 11 months ago
- ☆21Updated 11 months ago
- Towards a general language-audio model for computational paralinguistic tasks☆12Updated 5 months ago