Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch
☆42Apr 12, 2024Updated last year
Alternatives and similar repositories for Transformer-based-SER
Users that are interested in Transformer-based-SER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…☆12Apr 28, 2025Updated 10 months ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆33Sep 29, 2023Updated 2 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆25Feb 17, 2023Updated 3 years ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆40Dec 18, 2023Updated 2 years ago
- SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings☆15Jan 23, 2024Updated 2 years ago
- Trustworthy Speech Emotion Recognition☆13May 22, 2023Updated 2 years ago
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION☆23Dec 22, 2024Updated last year
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆27Apr 11, 2024Updated last year
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆164Nov 27, 2023Updated 2 years ago
- SpeechFormer++ in PyTorch☆50Jul 21, 2023Updated 2 years ago
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Jul 1, 2024Updated last year
- In this work is proposed a speech emotion recognition model based on the extraction of four different features got from RAVDESS sound fil…☆10Feb 27, 2022Updated 4 years ago
- [ICASSP 2025] "Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attentio…☆34Apr 27, 2025Updated 10 months ago
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- ☆14Sep 2, 2023Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33May 18, 2022Updated 3 years ago
- ☆28Jul 17, 2025Updated 8 months ago
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆50Apr 11, 2022Updated 3 years ago
- Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…☆266Nov 6, 2020Updated 5 years ago
- Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching☆51Jun 11, 2018Updated 7 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆407Sep 30, 2024Updated last year
- 😎 Awesome lists about Speech Emotion Recognition☆101Dec 24, 2024Updated last year
- ☆112Aug 10, 2022Updated 3 years ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- ☆22Sep 10, 2024Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- I created some notebooks about different concepts of financial engineering☆10Sep 28, 2025Updated 5 months ago
- Module list for Sectors's financial analytics workshop☆11Jun 20, 2024Updated last year
- SwinTransformer for Tensorflow2☆11Jul 7, 2022Updated 3 years ago
- Multi-modal Human Emotion Recognition of speech clips (audio + video) contained in RAVDESS dataset using a two stream architecture☆32Mar 2, 2023Updated 3 years ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆11Jun 19, 2024Updated last year
- CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer☆35Feb 4, 2025Updated last year
- RoBERTa based Spam Message Detection☆18Aug 24, 2025Updated 6 months ago
- Official implement of SpeechFormer written in Python (PyTorch).☆78Apr 1, 2023Updated 2 years ago
- A Multimodal Discord bot with machine learning functions, including LLM chat, Image generation, and Speech Generation capabilities☆12Jan 7, 2024Updated 2 years ago
- ☆12Aug 29, 2019Updated 6 years ago
- Persian Grapheme-to-Phoneme (G2P) converter☆21Dec 15, 2020Updated 5 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆152Oct 26, 2021Updated 4 years ago