Jiaxin-Ye / Emo-DNA
[ACM MM 2023] Official PyTorch implementation of "Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition".
☆12Updated last year
Alternatives and similar repositories for Emo-DNA:
Users that are interested in Emo-DNA are comparing it to the libraries listed below
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆21Updated 10 months ago
- ☆22Updated 7 months ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆11Updated 7 months ago
- ☆19Updated last year
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12Updated 9 months ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Updated last year
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆25Updated 5 months ago
- EMO-SUPERB submission☆42Updated 5 months ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆42Updated 8 months ago
- ☆21Updated last year
- ☆34Updated 10 months ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Updated 2 years ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆47Updated 3 months ago
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…☆57Updated 7 months ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆32Updated 7 months ago
- Trustworthy Speech Emotion Recognition☆13Updated last year
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆30Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆53Updated 3 months ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆22Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆36Updated last year
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆33Updated last year
- Official release of StyleTalk dataset.☆61Updated 7 months ago
- MSP-Podcast Challenge Baseline Code☆20Updated 8 months ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆49Updated last year
- ☆12Updated 11 months ago
- Pytorch implementation for “V2C: Visual Voice Cloning”☆30Updated 2 years ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆53Updated 8 months ago