vince9515 / SERLinks

语音情感识别代码，结合1D-CNN与GRU在语音增强的CASIA数据集实现语音情感识别，并利用注意力机制进行模型优化

☆16

Alternatives and similar repositories for SER

Users that are interested in SER are comparing it to the libraries listed below

Sorting:

yingdajun / SpeechEmotionAndPeopleAnalyse
用CASIA database数据集做的，做的语音情感识别和语音识人的练习
☆68Updated 2 years ago
yeyupiaoling / SpeechEmotionRecognition-Pytorch
基于Pytorch实现的语音情感识别
☆209Updated 2 months ago
lixiangucas01 / GLAM
This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…
☆47Updated 3 years ago
Kevinnan-teen / Speaker-Recognition
说话人识别（声纹识别）算法的Python实现。包括GMM（已完成）、GMM-UBM、ivector、基于深度学习的声纹识别（self-attention已完成）。
☆99Updated 2 years ago
Zhaofan-Su / SpeechEmotionRecognition-papers-codes
☆15Updated 6 years ago
Vincent-ZHQ / CA-MSER
Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information
☆146Updated last year
alaaNfissi / SigWavNet-Learning-Multiresolution-Signal-Wavelet-Network-for-Speech-Emotion-Recognition
This paper has been accepted for publication in IEEE Transactions on Affective Computing.
☆14Updated 4 months ago
azarmehri / lung-sound-vggish
Implementation of IEEE Access paper - Lung Sound Recognition Algorithm Based on VGGish-BiGRU
☆29Updated 5 years ago
vandana-rajan / 1D-Speech-Emotion-Recognition
Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM
☆106Updated 4 years ago
AryaAftab / LIGHT-SERNET
Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
☆74Updated 3 years ago
shock1ng / Mutil-Modal-wav2vec2.0-BERT
多模态，语音和文本结合的情感识别，大模型finetune
☆21Updated last year
zlzhang1124 / AcousticFeatureExtraction
Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包，进行简单的声学特征提取
☆201Updated 5 years ago
PandoraLS / SpeechEnhancement
语音增强
☆17Updated 4 years ago
PiotrSobczak / speech-emotion-recognition
Multi-modal Speech Emotion Recogniton on IEMOCAP dataset
☆89Updated 2 years ago
yeyupiaoling / SpeechEmotionRecognition-PaddlePaddle
语音感情识别
☆36Updated 2 months ago
Jiaxin-Ye / TIM-Net_SER
[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…
☆175Updated last year
hellolzc / SpeechEmotionRecognition-emodb
Speech Emotion Recognition
☆28Updated 5 years ago
scutcsq / DWFormer
DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)
☆60Updated last year
kadoufall / Urban-Sound-Classification-VS
城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN
☆95Updated 6 years ago
IliaZenkov / transformer-cnn-emotion-recognition
Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…
☆258Updated 4 years ago
SCNU-RISLAB / CNN-Transformer-and-Multidimensional-Attention-Mechanism
☆14Updated last year
JabuMlDev / Speaker-VGG-CCT
Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…
☆22Updated 2 years ago
hotelll / speech-emotion-recognition
这个项目将 RAVDESS 数据集切割成 1s 短语音，利用 openSMILE+CNN 进行训练，目标是将短语音分类到四种情感中，分别是：开心（happy）、悲伤（sad）、生气（angry）和中性（neutral）。最后准确率达到 76% 左右。
☆57Updated 4 years ago
HappyColor / SpeechFormer2
SpeechFormer++ in PyTorch
☆48Updated last year
glam-imperial / semantic_speech_emotion_recognition
This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…
☆24Updated 4 years ago
rcantini / speech_emotion_recognition
How to detect emotions from speech using Bi-directional LSTM networks and attention mechanism in Keras.
☆20Updated last year
bagustris / deep-mlp-ser
Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition
☆11Updated last year
ericguizzo / multi_time_scale
Implementation of the multi-time-scale convolution layer used in the paper Multi-Time-Scale Convolution for Emotion Recognition from Spee…
☆11Updated 5 years ago
ECNU-Cross-Innovation-Lab / ShiftSER
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
☆37Updated last year
Hbbbbbby / EmotionRecognition_2Dcnn-lstm
The code ruproduced the emotion recognition model, 2D CNN LSTM networks, which based on <Speechemotionrecognitionusingdeep1D&2DCNNLSTMnet…
☆23Updated 4 years ago