vince9515 / SERLinks
语音情感识别代码,结合1D-CNN与GRU在语音增强的CASIA数据集实现语音情感识别,并利用注意力机制进行模型优化
☆17Updated 4 years ago
Alternatives and similar repositories for SER
Users that are interested in SER are comparing it to the libraries listed below
Sorting:
- ☆17Updated 6 years ago
- 基于Pytorch实现的语音情感识别☆258Updated last month
- 用CASIA database数据集做的,做的语音情感识别和语音识人的练习☆74Updated 3 years ago
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆216Updated 5 years ago
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆50Updated 3 years ago
- Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM☆109Updated 4 years ago
- Speech Emotion Recognition☆28Updated 5 years ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆95Updated 2 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆187Updated last year
- Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…☆264Updated 5 years ago
- 语音感情识别☆44Updated last month
- Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition☆83Updated 3 years ago
- 基于python的hmm-gmm声学模型☆29Updated 7 years ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆164Updated 2 years ago
- 多模态,语音和文本结合的情感识别,大模型finetune☆23Updated 2 years ago
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Updated 2 years ago
- 这个项目将 RAVDESS 数据集切割成 1s 短语音,利用 openSMILE+CNN 进行训练,目标是将短语音分类到四种情感中,分别是:开心(happy)、悲伤(sad)、生气(angry)和中性(neutral)。最后准确率达到 76% 左右。☆64Updated 4 years ago
- alaaNfissi / SigWavNet-Learning-Multiresolution-Signal-Wavelet-Network-for-Speech-Emotion-RecognitionThis paper has been accepted for publication in IEEE Transactions on Affective Computing.☆19Updated 11 months ago
- Implementation of IEEE Access paper - Lung Sound Recognition Algorithm Based on VGGish-BiGRU☆30Updated 6 years ago
- ☆27Updated 6 months ago
- ☆10Updated last year
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆69Updated last year
- 说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。☆106Updated 2 years ago
- A pytorch implementation of Speech emotion recognition using deep 1D & 2D CNN LSTM networks☆27Updated 2 years ago
- The code ruproduced the emotion recognition model, 2D CNN LSTM networks, which based on <Speechemotionrecognitionusingdeep1D&2DCNNLSTMnet…☆26Updated 4 years ago
- 语音增强☆18Updated 4 years ago
- A spectro-temporal fusion feature, STgram, with MobileFaceNet For more stable Anomalous Sound Detection☆100Updated 2 years ago
- SpeechFormer++ in PyTorch☆50Updated 2 years ago
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION☆23Updated last year
- Deformable Speech Transformer (DST)☆35Updated last year