YuanGongND / llm_speech_emotion_challenge
☆22Updated 7 months ago
Alternatives and similar repositories for llm_speech_emotion_challenge:
Users that are interested in llm_speech_emotion_challenge are comparing it to the libraries listed below
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆25Updated 4 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆52Updated 2 months ago
- Official release of StyleTalk dataset.☆60Updated 6 months ago
- ☆31Updated last year
- ☆19Updated last year
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆81Updated last year
- EMO-SUPERB submission☆42Updated 4 months ago
- ☆25Updated 6 months ago
- ☆43Updated last year
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆39Updated last year
- ☆12Updated 10 months ago
- ☆48Updated 3 years ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆15Updated 2 months ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆33Updated last year
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆47Updated last year
- LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …☆52Updated last month
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆11Updated 8 months ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆23Updated 5 months ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Updated last year
- ConMamba for Automatic Speech Recognition☆54Updated 5 months ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆42Updated 7 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆41Updated 3 months ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆20Updated 4 months ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆22Updated 11 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆43Updated 2 months ago
- ARCH: Audio Representations benCHmark☆39Updated 5 months ago
- ☆36Updated 2 years ago