😎 Awesome lists about Speech Emotion Recognition
☆101Dec 24, 2024Updated last year
Alternatives and similar repositories for awesome-speech-emotion-recognition
Users that are interested in awesome-speech-emotion-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆19Dec 20, 2023Updated 2 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆414Sep 30, 2024Updated last year
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆32Sep 29, 2023Updated 2 years ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆83Mar 12, 2024Updated 2 years ago
- [KDD 2026] Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe☆32Aug 10, 2025Updated 9 months ago
- ☆13Nov 25, 2023Updated 2 years ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".☆240May 19, 2026Updated last week
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆70Jul 8, 2024Updated last year
- ☆70Sep 13, 2024Updated last year
- Emotion recognition from IEMOCAP datasets.☆45Oct 6, 2020Updated 5 years ago
- Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch☆42Apr 12, 2024Updated 2 years ago
- [ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training fo…☆1,131Dec 23, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better u…☆27Apr 19, 2024Updated 2 years ago
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆17Aug 8, 2021Updated 4 years ago
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆56Apr 14, 2025Updated last year
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆152Oct 26, 2021Updated 4 years ago
- [INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for …☆178May 20, 2025Updated last year
- [INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark☆319Mar 18, 2026Updated 2 months ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Deformable Speech Transformer (DST)☆35Aug 8, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆28Apr 11, 2024Updated 2 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆139Jan 6, 2025Updated last year
- ☆14Oct 12, 2024Updated last year
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆59Jul 1, 2024Updated last year
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆689Dec 25, 2024Updated last year
- Extract frequency, power, width and dissonance of formants from wav files☆28Jun 3, 2022Updated 3 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆191May 15, 2024Updated 2 years ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆40Dec 18, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 로봇의 감정 및 개성을 표현할 수 있는 대화형 음성합성 오픈소스 플랫폼☆108Feb 5, 2025Updated last year
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆12Dec 19, 2025Updated 5 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Trustworthy Speech Emotion Recognition☆13May 22, 2023Updated 3 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings☆15Jan 23, 2024Updated 2 years ago
- A list of podcast URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.☆44Mar 9, 2022Updated 4 years ago