ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"
☆14Nov 29, 2024Updated last year
Alternatives and similar repositories for RecurrentJointAttentionwithLSTMs
Users that are interested in RecurrentJointAttentionwithLSTMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations☆11Jan 17, 2023Updated 3 years ago
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆47Nov 29, 2024Updated last year
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆34Nov 29, 2024Updated last year
- ABAW6 (CVPR-W) We achieved second place in the valence arousal challenge of ABAW6☆32May 21, 2024Updated 2 years ago
- Source code for ICASSP 2022 paper "MM-DFN: Multimodal Dynamic Fusion Network For Emotion Recognition in Conversations".☆94Apr 21, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Submission to the Affective Behavior Analysis in-the-wild (ABAW) 2020 competition.☆37Feb 15, 2023Updated 3 years ago
- "MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23☆24Feb 26, 2023Updated 3 years ago
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- The code for our proposed Task-driven Semantic Coding via Reinforcement Learning in TIP2021☆10Jan 24, 2023Updated 3 years ago
- ☆12Sep 25, 2023Updated 2 years ago
- [TOMM 2023] Emotion recognition methods through facial expression, speeches, audios, and multimodal data☆19Oct 25, 2023Updated 2 years ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆28Dec 4, 2024Updated last year
- Code for "Semantic Perturbations with Normalizing Flows for Improved Generalization"☆11Jul 13, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An online emotion recognition classifier using audio-visual modalities and deep reinforcement learning.☆10Jun 25, 2020Updated 5 years ago
- Code and Dataset for our CVPR 2022 paper "Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training"☆12Jul 8, 2022Updated 3 years ago
- SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution☆23Jan 30, 2026Updated 4 months ago
- ☆12Dec 22, 2022Updated 3 years ago
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.☆14Oct 29, 2023Updated 2 years ago
- ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition☆50Jan 15, 2024Updated 2 years ago
- Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues☆19Sep 16, 2024Updated last year
- ☆25Apr 16, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆26Apr 24, 2024Updated 2 years ago
- A image caption dataset about images from www.dpchallenge.com.☆20Dec 12, 2019Updated 6 years ago
- Multimodal preprocessing on IEMOCAP dataset☆13Jun 8, 2018Updated 8 years ago
- Source code for paper Multi-Task Learning for Depression Detection in Dialogs (SIGDial 2022)☆12Jan 18, 2025Updated last year
- Spiideo SoccerNet SynLoc - Single Frame World Coordinate Athlete Detection and Localization with Synthetic Data☆22Mar 27, 2026Updated 2 months ago
- ☆13Apr 27, 2023Updated 3 years ago
- An implementation for CVRP problem with A3C+Attention mechanism and GCN☆18May 17, 2020Updated 6 years ago
- A Large-scale, Multi-modal, Compound Affective Database for Dynamic Facial Expression Recognition in the Wild.☆64Dec 30, 2025Updated 5 months ago
- PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition☆12Mar 20, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repository provides the ability to recoginize the emotion from video using audiovisual modalities。端到端的多模态情感识别代码☆11Mar 5, 2023Updated 3 years ago
- [IEEE T-BIOM] FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆20Jan 15, 2026Updated 5 months ago
- The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.☆123Sep 20, 2021Updated 4 years ago
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆14Jun 23, 2022Updated 3 years ago
- A Fully End2End Multimodal System for Fast Yet Effective Video Emotion Recognition☆39Aug 12, 2024Updated last year
- ☆11Nov 11, 2022Updated 3 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year