Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch
☆42Apr 12, 2024Updated last year
Alternatives and similar repositories for Transformer-based-SER
Users that are interested in Transformer-based-SER are comparing it to the libraries listed below
Sorting:
- Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…☆12Apr 28, 2025Updated 10 months ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆33Sep 29, 2023Updated 2 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆24Feb 17, 2023Updated 3 years ago
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION☆23Dec 22, 2024Updated last year
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆40Dec 18, 2023Updated 2 years ago
- SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings☆15Jan 23, 2024Updated 2 years ago
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆27Apr 11, 2024Updated last year
- MFCC features + SVM for speech emotion classification☆16Oct 21, 2020Updated 5 years ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆164Nov 27, 2023Updated 2 years ago
- SpeechFormer++ in PyTorch☆50Jul 21, 2023Updated 2 years ago
- ☆27Jul 17, 2025Updated 7 months ago
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Jul 1, 2024Updated last year
- ☆10Jul 16, 2024Updated last year
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- In this work is proposed a speech emotion recognition model based on the extraction of four different features got from RAVDESS sound fil…☆10Feb 27, 2022Updated 4 years ago
- Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET☆27Mar 11, 2022Updated 3 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33May 18, 2022Updated 3 years ago
- 😎 Awesome lists about Speech Emotion Recognition☆101Dec 24, 2024Updated last year
- ☆112Aug 10, 2022Updated 3 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆400Sep 30, 2024Updated last year
- ☆22Sep 10, 2024Updated last year
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆50Apr 11, 2022Updated 3 years ago
- Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…☆264Nov 6, 2020Updated 5 years ago
- Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching☆51Jun 11, 2018Updated 7 years ago
- Jupyter Notebook Praktikum Projects. This is repository with data analyst educational projects from Yandex.Praktikum.☆11Feb 21, 2021Updated 5 years ago
- Multi-modal Human Emotion Recognition of speech clips (audio + video) contained in RAVDESS dataset using a two stream architecture☆32Mar 2, 2023Updated 2 years ago
- PySYCL is an open-source Python interface for SYCL.☆15Apr 18, 2025Updated 10 months ago
- XCORE-VOICE Solution☆17Jun 12, 2025Updated 8 months ago
- CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer☆35Feb 4, 2025Updated last year
- This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …☆50Sep 16, 2024Updated last year
- An accurate scrapper to scrape popular persian websites, mostly intended to be used as a tool to create large corpora for Persian languag…☆37Jan 20, 2025Updated last year
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- ☆11Apr 4, 2021Updated 4 years ago
- Module list for Sectors's financial analytics workshop☆11Jun 20, 2024Updated last year
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- Pytorch implementation for the paper: Adversarial alignment and graph fusion via information bottleneck for multimodal emotion recognitio…☆15Sep 19, 2024Updated last year
- A semi print-in-place hand for human-like manipulation, designed to be built by anyone.☆17Jan 5, 2026Updated last month
- data compression library for embedded/real-time systems☆13Dec 8, 2015Updated 10 years ago
- [IROS 2025] EgoLoc: Zero-Shot Temporal Interaction Localization for Egocentric Videos☆32Jan 13, 2026Updated last month