Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, respectively the Wav2Vec2 model to extract the features and classify the emotions from the text, respectively audio data, then passed their features and their classification through an MLP to achieve better results…
☆11Jun 19, 2024Updated last year
Alternatives and similar repositories for multimodal-speech-emotion-recognition
Users that are interested in multimodal-speech-emotion-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Fully End2End Multimodal System for Fast Yet Effective Video Emotion Recognition☆39Aug 12, 2024Updated last year
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION☆23Dec 22, 2024Updated last year
- "MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23☆23Feb 26, 2023Updated 3 years ago
- ☆19Oct 13, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A multimodal SER project combining BERT and ECAPA-TDNN with cross-attention-based fusion on the IEMOCAP dataset.☆10Dec 9, 2024Updated last year
- Trustworthy Speech Emotion Recognition☆13May 22, 2023Updated 2 years ago
- Multimodal Emotion Recognition in a video using feature level fusion of audio and visual modalities☆15Jul 5, 2018Updated 7 years ago
- Code for IJCB 2023 paper - GaitRef: Gait Recognition with Refined Sequential Skeletons☆18Apr 22, 2023Updated 2 years ago
- MultiEMO: An Attention-Based Correlation-Aware Multimodal Fusion Framework for Emotion Recognition in Conversations (ACL 2023)☆93Nov 17, 2023Updated 2 years ago
- The repository for a coming gait recognition work.☆19Jun 8, 2023Updated 2 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆21Aug 9, 2023Updated 2 years ago
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆46Nov 29, 2024Updated last year
- Curated list of publically available railway related datasets captured with point cloud data.☆22Oct 21, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Coherent Deconfounding Autoencoder (CODE-AE) can extract both common biological signals shared by incoherent samples and private represen…☆21Oct 1, 2024Updated last year
- Source code for the Gait Recognition using LSTM, presented in the paper "Multi-model Long Short-term Memory Network for Gait Recognition …☆19Apr 13, 2022Updated 3 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- Repository for my paper: Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recog…☆20Mar 13, 2024Updated 2 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- LandmarkGait: Intrinsic Human Parsing for Gait Recognition (ACM MM 2023)☆18Jun 13, 2024Updated last year
- Pytorch implementation for the paper: Multivariate, Multi-frequency and Multimodal: Rethinking Graph Neural Networks for Emotion Recognit…☆57Dec 5, 2023Updated 2 years ago
- 该仓库主要描述了CCAC2023多模态对话情绪识别评测第3名的实现过程☆12Aug 11, 2024Updated last year
- MultiModal Sentiment Analysis (Text and Audio) (Pytorch)☆23Jul 27, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- QAGait: Revisit Gait Recognition From a Quality Perspective (AAAI 2024)☆23Aug 26, 2024Updated last year
- Multimodal (text, acoustic, visual) Sentiment Analysis and Emotion Recognition on CMU-MOSEI dataset.☆29Nov 8, 2020Updated 5 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆69Jul 8, 2024Updated last year
- Multimodal Emotion eXpression Capture Amsterdam. Pipeline for capturing emotion expressions from multiple modalities (video, audio, text)…☆36Apr 7, 2025Updated last year
- ☆11Mar 23, 2026Updated 2 weeks ago
- ☆11Nov 11, 2022Updated 3 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- Multimodal emotion recognition system of attention based vision network + audio network☆14Jul 21, 2020Updated 5 years ago
- 多模态,语音和文本结合的情感识别,大模型finetune☆25Nov 19, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- Extract Unique Word Lists From Wikipedia Database☆13May 27, 2020Updated 5 years ago
- Multimodal Sentiment Analysis using unified transformer☆25Oct 11, 2022Updated 3 years ago
- Toate resursele de invatat pentru cei de la informatica romana, de la Babes-Bolyai.☆37Feb 6, 2017Updated 9 years ago
- Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021☆11Aug 24, 2021Updated 4 years ago
- NuNER is the family of SOTA Foundation and Zero-shot for Entity Recognition☆15Jun 11, 2024Updated last year
- This repository provides implementation for the paper "Self-attention fusion for audiovisual emotion recognition with incomplete data".☆159Sep 16, 2024Updated last year