Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, respectively the Wav2Vec2 model to extract the features and classify the emotions from the text, respectively audio data, then passed their features and their classification through an MLP to achieve better results…
☆11Jun 19, 2024Updated last year
Alternatives and similar repositories for multimodal-speech-emotion-recognition
Users that are interested in multimodal-speech-emotion-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Fully End2End Multimodal System for Fast Yet Effective Video Emotion Recognition☆39Aug 12, 2024Updated last year
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION☆23Dec 22, 2024Updated last year
- "MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23☆23Feb 26, 2023Updated 3 years ago
- ☆19Oct 13, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A multimodal SER project combining BERT and ECAPA-TDNN with cross-attention-based fusion on the IEMOCAP dataset.☆10Dec 9, 2024Updated last year
- Trustworthy Speech Emotion Recognition☆13May 22, 2023Updated 2 years ago
- Multimodal Emotion Recognition in a video using feature level fusion of audio and visual modalities☆15Jul 5, 2018Updated 7 years ago
- Code for IJCB 2023 paper - GaitRef: Gait Recognition with Refined Sequential Skeletons☆18Apr 22, 2023Updated 3 years ago
- MultiEMO: An Attention-Based Correlation-Aware Multimodal Fusion Framework for Emotion Recognition in Conversations (ACL 2023)☆94Nov 17, 2023Updated 2 years ago
- The repository for a coming gait recognition work.☆19Jun 8, 2023Updated 2 years ago
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆47Nov 29, 2024Updated last year
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆22Aug 9, 2023Updated 2 years ago
- Curated list of publically available railway related datasets captured with point cloud data.☆22Oct 21, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Coherent Deconfounding Autoencoder (CODE-AE) can extract both common biological signals shared by incoherent samples and private represen…☆21Oct 1, 2024Updated last year
- Source code for the Gait Recognition using LSTM, presented in the paper "Multi-model Long Short-term Memory Network for Gait Recognition …☆19Apr 13, 2022Updated 4 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- Repository for my paper: Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recog…☆20Mar 13, 2024Updated 2 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- [ACM MM 2023] LandmarkGait: Intrinsic Human Parsing for Gait Recognition☆18Jun 13, 2024Updated last year
- 该仓库主要描述了CCAC2023多模态对话情绪识别评测第3名的实现过程☆12Aug 11, 2024Updated last year
- Pytorch implementation for the paper: Multivariate, Multi-frequency and Multimodal: Rethinking Graph Neural Networks for Emotion Recognit…☆58Dec 5, 2023Updated 2 years ago
- MultiModal Sentiment Analysis (Text and Audio) (Pytorch)☆23Jul 27, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [AAAI 2024] QAGait: Revisit Gait Recognition From a Quality Perspective☆23Aug 26, 2024Updated last year
- Multimodal (text, acoustic, visual) Sentiment Analysis and Emotion Recognition on CMU-MOSEI dataset.☆30Nov 8, 2020Updated 5 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆70Jul 8, 2024Updated last year
- Multimodal Emotion eXpression Capture Amsterdam. Pipeline for capturing emotion expressions from multiple modalities (video, audio, text)…☆37Apr 7, 2025Updated last year
- ☆12Mar 23, 2026Updated last month
- ☆11Nov 11, 2022Updated 3 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- Multimodal emotion recognition system of attention based vision network + audio network☆14Jul 21, 2020Updated 5 years ago
- 多模态,语音和文本结合的情感识别,大模型finetune☆25Nov 19, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- Extract Unique Word Lists From Wikipedia Database☆13May 27, 2020Updated 5 years ago
- Multimodal Sentiment Analysis using unified transformer☆25Oct 11, 2022Updated 3 years ago
- Toate resursele de invatat pentru cei de la informatica romana, de la Babes-Bolyai.☆37Feb 6, 2017Updated 9 years ago
- Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021☆11Aug 24, 2021Updated 4 years ago
- NuNER is the family of SOTA Foundation and Zero-shot for Entity Recognition☆15Jun 11, 2024Updated last year
- This repository provides implementation for the paper "Self-attention fusion for audiovisual emotion recognition with incomplete data".☆162Sep 16, 2024Updated last year