nhattruongpham / mmserLinks
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
☆15Updated 2 years ago
Alternatives and similar repositories for mmser
Users that are interested in mmser are comparing it to the libraries listed below
Sorting:
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆33Updated 2 years ago
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Updated last year
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆50Updated 3 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆24Updated 2 years ago
- SpeechFormer++ in PyTorch☆50Updated 2 years ago
- Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch☆42Updated last year
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆69Updated last year
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Updated last year
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆40Updated 2 years ago
- ☆49Updated 2 years ago
- Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in…☆52Updated 4 years ago
- A survey of deep multimodal emotion recognition.☆54Updated 3 years ago
- [InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"☆13Updated last year
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 3 years ago
- Repository for code and paper submitted for APSIPA 2019, Lanzhou, China☆21Updated last year
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆55Updated 3 months ago
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆28Updated 2 years ago
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆118Updated 4 years ago
- Deepfake cross-lingual evaluation dataset (DECRO) is constructed to evaluate the influence of language differences on deepfake detection.…☆15Updated 2 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆14Updated last year
- Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"☆20Updated last month
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆15Updated 3 years ago
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Updated 2 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Updated 4 years ago
- PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition☆12Updated 3 years ago
- Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition☆82Updated 3 years ago
- The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.☆123Updated 4 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆211Updated 3 years ago
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…☆19Updated last year
- Chinese BERT classification with tf2.0 and audio classification with mfcc☆14Updated 5 years ago