abikaki / awesome-speech-emotion-recognitionView external linksLinks
π Awesome lists about Speech Emotion Recognition
β101Dec 24, 2024Updated last year
Alternatives and similar repositories for awesome-speech-emotion-recognition
Users that are interested in awesome-speech-emotion-recognition are comparing it to the libraries listed below
Sorting:
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.β21Dec 20, 2023Updated 2 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at Iβ¦β18Feb 17, 2023Updated 3 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)β69Jul 8, 2024Updated last year
- A collection of datasets for the purpose of emotion recognition/detection in speech.β399Sep 30, 2024Updated last year
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)β33Sep 29, 2023Updated 2 years ago
- Speaker overlap-aware Neural Diarizationβ12Feb 13, 2023Updated 3 years ago
- β13Nov 25, 2023Updated 2 years ago
- Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better uβ¦β26Apr 19, 2024Updated last year
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024β15Jun 11, 2024Updated last year
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognitionβ27Apr 11, 2024Updated last year
- Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorchβ42Apr 12, 2024Updated last year
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognitionβ81Mar 12, 2024Updated last year
- [INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmarkβ309Mar 31, 2025Updated 10 months ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".β208Feb 10, 2026Updated last week
- β70Sep 13, 2024Updated last year
- [ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training foβ¦β1,056Dec 23, 2024Updated last year
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.β47Apr 14, 2025Updated 10 months ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-dailyβ14Jan 6, 2025Updated last year
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. β¦β687Dec 25, 2024Updated last year
- [INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for β¦β171May 20, 2025Updated 8 months ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkitβ13Nov 18, 2022Updated 3 years ago
- Emotion recognition from IEMOCAP datasets.β42Oct 6, 2020Updated 5 years ago
- Diffusion Model for Voice Conversionβ69Mar 14, 2024Updated last year
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conteβ¦β43Mar 3, 2025Updated 11 months ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representationsβ40Dec 18, 2023Updated 2 years ago
- This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".β10Dec 2, 2024Updated last year
- In this work is proposed a speech emotion recognition model based on the extraction of four different features got from RAVDESS sound filβ¦β10Feb 27, 2022Updated 3 years ago
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection frβ¦β11Dec 19, 2025Updated last month
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.β25Nov 25, 2025Updated 2 months ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Eβ¦β187May 15, 2024Updated last year
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verificationβ14Mar 14, 2025Updated 11 months ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancementβ14Sep 6, 2024Updated last year
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)β12Sep 2, 2024Updated last year
- An open-source Kazakh Emotional Text-to-Speech Datasetβ35Aug 1, 2025Updated 6 months ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioningβ16Jun 23, 2024Updated last year
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognitionβ153Oct 26, 2021Updated 4 years ago
- β12Dec 29, 2023Updated 2 years ago
- Trustworthy Speech Emotion Recognitionβ13May 22, 2023Updated 2 years ago
- ITALIC: An ITALian Intent Classification Datasetβ14Nov 24, 2023Updated 2 years ago