ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"
☆14Nov 29, 2024Updated last year
Alternatives and similar repositories for RecurrentJointAttentionwithLSTMs
Users that are interested in RecurrentJointAttentionwithLSTMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆46Nov 29, 2024Updated last year
- ☆22Apr 22, 2024Updated last year
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆33Nov 29, 2024Updated last year
- ABAW6 (CVPR-W) We achieved second place in the valence arousal challenge of ABAW6☆31May 21, 2024Updated last year
- Source code for ICASSP 2022 paper "MM-DFN: Multimodal Dynamic Fusion Network For Emotion Recognition in Conversations".☆93Apr 21, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Submission to the Affective Behavior Analysis in-the-wild (ABAW) 2020 competition.☆37Feb 15, 2023Updated 3 years ago
- "MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23☆23Feb 26, 2023Updated 3 years ago
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- The code for our proposed Task-driven Semantic Coding via Reinforcement Learning in TIP2021☆10Jan 24, 2023Updated 3 years ago
- We achieved the 2nd and 3rd places in ABAW3 and ABAW5, respectively.☆31Mar 7, 2024Updated 2 years ago
- [TOMM 2023] Emotion recognition methods through facial expression, speeches, audios, and multimodal data☆19Oct 25, 2023Updated 2 years ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆28Dec 4, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for "Semantic Perturbations with Normalizing Flows for Improved Generalization"☆11Jul 13, 2021Updated 4 years ago
- An online emotion recognition classifier using audio-visual modalities and deep reinforcement learning.☆10Jun 25, 2020Updated 5 years ago
- SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution☆23Jan 30, 2026Updated 2 months ago
- ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition☆50Jan 15, 2024Updated 2 years ago
- This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.☆14Oct 29, 2023Updated 2 years ago
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- ☆13Apr 12, 2022Updated 4 years ago
- Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues☆17Sep 16, 2024Updated last year
- ☆25Apr 16, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Nov 2, 2021Updated 4 years ago
- ☆26Apr 24, 2024Updated last year
- A image caption dataset about images from www.dpchallenge.com.☆20Dec 12, 2019Updated 6 years ago
- Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)☆14Jul 13, 2020Updated 5 years ago
- Source code for paper Multi-Task Learning for Depression Detection in Dialogs (SIGDial 2022)☆12Jan 18, 2025Updated last year
- Tensorflow implementation of SICNet: a deep learning-based successive interference cancellation (SIC) receiver for non-orthogonal downlin…☆18Jul 25, 2022Updated 3 years ago
- Spiideo SoccerNet SynLoc - Single Frame World Coordinate Athlete Detection and Localization with Synthetic Data☆20Mar 27, 2026Updated 3 weeks ago
- This is the repository containing the codes for the paper "Scalable Deep Reinforcement Learning for Routing and Spectrum Access in Physic…☆15Feb 4, 2022Updated 4 years ago
- ☆13Apr 27, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Large-scale, Multi-modal, Compound Affective Database for Dynamic Facial Expression Recognition in the Wild.☆63Dec 30, 2025Updated 3 months ago
- PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition☆12Mar 20, 2022Updated 4 years ago
- This repository provides the ability to recoginize the emotion from video using audiovisual modalities。端到端的多模态情感识别代码☆11Mar 5, 2023Updated 3 years ago
- [IEEE T-BIOM] FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆20Jan 15, 2026Updated 3 months ago
- Adaptive multi-layer perceptual attention network for facial expression recognition, TCSVT, 2022☆14Sep 11, 2022Updated 3 years ago
- The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.☆123Sep 20, 2021Updated 4 years ago
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆14Jun 23, 2022Updated 3 years ago