HoseinAzad/Transformer-based-SER

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HoseinAzad/Transformer-based-SER)

HoseinAzad / Transformer-based-SER

Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch

☆42

Alternatives and similar repositories for Transformer-based-SER

Users that are interested in Transformer-based-SER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NeuroByte-Consulting / Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs
View on GitHub
Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…
☆12Apr 28, 2025Updated last year
hwang9u / emocatcher
View on GitHub
[RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)
☆31Sep 29, 2023Updated 2 years ago
JabuMlDev / Speaker-VGG-CCT
View on GitHub
Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…
☆25Feb 17, 2023Updated 3 years ago
nhattruongpham / mmser
View on GitHub
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
☆15Jan 23, 2024Updated 2 years ago
Jason-Oleana / speech-emotion-classification
View on GitHub
MFCC features + SVM for speech emotion classification
☆16Oct 21, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
usc-sail / trust-ser
View on GitHub
Trustworthy Speech Emotion Recognition
☆13May 22, 2023Updated 3 years ago
ASolitaryMan / HFLEA
View on GitHub
FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION
☆23Dec 22, 2024Updated last year
bubaimaji / cmt-mser
View on GitHub
"MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23
☆24Feb 26, 2023Updated 3 years ago
ECNU-Cross-Innovation-Lab / ENT
View on GitHub
[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
☆28Apr 11, 2024Updated 2 years ago
Vincent-ZHQ / CA-MSER
View on GitHub
Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information
☆163Nov 27, 2023Updated 2 years ago
HappyColor / SpeechFormer2
View on GitHub
SpeechFormer++ in PyTorch
☆50Jul 21, 2023Updated 3 years ago
flaviorainhoavila / IEMOCAPspeechEmotionRecognition
View on GitHub
Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET
☆27Mar 11, 2022Updated 4 years ago
AndreaLombax / Speech_emotion_recognition
View on GitHub
In this work is proposed a speech emotion recognition model based on the extraction of four different features got from RAVDESS sound fil…
☆10Feb 27, 2022Updated 4 years ago
vaibhavsundharam / Speech-Emotion-Analysis
View on GitHub
Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…
☆25Jun 23, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
usc-sail / peft-ser
View on GitHub
[ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…
☆60Jul 1, 2024Updated 2 years ago
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
zyh9929 / RL-EMO
View on GitHub
☆15Sep 2, 2023Updated 2 years ago
IliaZenkov / transformer-cnn-emotion-recognition
View on GitHub
Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…
☆269Nov 6, 2020Updated 5 years ago
royal-dargon / MDSE
View on GitHub
☆24Sep 10, 2024Updated last year
SuperKogito / SER-datasets
View on GitHub
A collection of datasets for the purpose of emotion recognition/detection in speech.
☆420Sep 30, 2024Updated last year
abikaki / awesome-speech-emotion-recognition
View on GitHub
😎 Awesome lists about Speech Emotion Recognition
☆101Dec 24, 2024Updated last year
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
WarmCongee / SDUMC
View on GitHub
[ICASSP 2025] "Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attentio…
☆38Apr 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SCNU-RISLAB / CNN-Transformer-and-Multidimensional-Attention-Mechanism
View on GitHub
☆34Jul 17, 2025Updated last year
gianscuri / Emotion-Recognition_SER-FER_RAVDESS
View on GitHub
Multi-modal Human Emotion Recognition of speech clips (audio + video) contained in RAVDESS dataset using a two stream architecture
☆32Mar 2, 2023Updated 3 years ago
bagustris / ssl-ser
View on GitHub
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆10Mar 15, 2023Updated 3 years ago
LorenzoGianassi / Land-Diffuser
View on GitHub
The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…
☆13Dec 23, 2023Updated 2 years ago
CZ26 / CycleTransGAN-EVC
View on GitHub
CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer
☆35Feb 4, 2025Updated last year
Meatfucker / metatron2
View on GitHub
A Multimodal Discord bot with machine learning functions, including LLM chat, Image generation, and Speech Generation capabilities
☆12Jan 7, 2024Updated 2 years ago
SkyOL5 / VQA-CoAttention
View on GitHub
☆12Aug 29, 2019Updated 6 years ago
fushengwuyu / R-Drop
View on GitHub
RDrop 的 torch版
☆16Jul 15, 2021Updated 5 years ago
Renovamen / Speech-Emotion-Recognition
View on GitHub
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
☆1,310Mar 25, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
b04901014 / FT-w2v2-ser
View on GitHub
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
☆152Oct 26, 2021Updated 4 years ago
silentbicycle / heatshrink
View on GitHub
data compression library for embedded/real-time systems
☆13Dec 8, 2015Updated 10 years ago
pgvector / pgvector-dart
View on GitHub
pgvector support for Dart
☆15Jul 9, 2026Updated 2 weeks ago
cMadan / MRIdataviz
View on GitHub
Supplementary to "Data visualization for inference in tomographic brain imaging"
☆10Sep 30, 2019Updated 6 years ago
zerohd4869 / MM-DFN
View on GitHub
Source code for ICASSP 2022 paper "MM-DFN: Multimodal Dynamic Fusion Network For Emotion Recognition in Conversations".
☆94Apr 21, 2023Updated 3 years ago
rogeroyer / face_recognition
View on GitHub
一个带有界面的人脸识别小项目
☆11Jan 22, 2019Updated 7 years ago
iiscleap / ZEST
View on GitHub
Zero-Shot Emotion Style Transfer
☆49Apr 23, 2025Updated last year