FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition
☆34Nov 29, 2024Updated last year
Alternatives and similar repositories for Cross-Attentional-AV-Fusion
Users that are interested in Cross-Attentional-AV-Fusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆47Nov 29, 2024Updated last year
- ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition☆50Jan 15, 2024Updated 2 years ago
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021☆21Apr 5, 2022Updated 4 years ago
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Nov 29, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Submission to the Affective Behavior Analysis in-the-wild (ABAW) 2020 competition.☆37Feb 15, 2023Updated 3 years ago
- ☆12Aug 24, 2020Updated 5 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆58Apr 17, 2024Updated 2 years ago
- ☆15Sep 24, 2021Updated 4 years ago
- Repository for my paper: Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recog…☆20Mar 13, 2024Updated 2 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- ABAW6 (CVPR-W) We achieved second place in the valence arousal challenge of ABAW6☆32May 21, 2024Updated 2 years ago
- Official implementation for SPA: A Graph Spectral Alignment Perspective for Domain Adaptation (NeurIPS 2023)☆18Dec 21, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A comprehensive collection of awesome research and other items about video domain adaptation☆114Jan 18, 2025Updated last year
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 5 years ago
- A PyTorch Implementation of AC-SUM-GAN from "AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Vid…☆28May 4, 2022Updated 4 years ago
- multi-modal sentiment☆16Nov 19, 2024Updated last year
- Depression Recognition☆12Mar 11, 2024Updated 2 years ago
- CVPR'25 official code for O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language Models☆16Sep 19, 2025Updated 9 months ago
- ☆20Oct 23, 2022Updated 3 years ago
- Official code for paper "Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models, ICML2024"☆28Feb 2, 2025Updated last year
- PyTorch code for "M³T: Multi-Modal Multi-Task Learning for Continuous Valence-Arousal Estimation"☆26Feb 10, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆33Mar 10, 2023Updated 3 years ago
- [Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition☆121Aug 29, 2025Updated 9 months ago
- Code repository for the paper "Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction" @ ECCV 2024 (Oral)☆13Apr 22, 2025Updated last year
- experiments about AudioSet☆43Jul 22, 2023Updated 2 years ago
- [CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…☆28Apr 10, 2023Updated 3 years ago
- [ICASSP 2025] Official PyTorch code for training and inference pipeline for DepMamba: Progressive Fusion Mamba for Multimodal Depression …☆106Mar 11, 2025Updated last year
- ☆23Oct 23, 2024Updated last year
- ☆20Jan 17, 2024Updated 2 years ago
- Code for paper "MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recogni…☆16Jun 21, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICCV2025] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectors☆28Jul 10, 2025Updated 11 months ago
- ☆16Apr 4, 2022Updated 4 years ago
- Model calibration in CLIP Adapters☆20Aug 19, 2024Updated last year
- Official source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"☆92Jun 2, 2026Updated 2 weeks ago
- Towards a general language-audio model for computational paralinguistic tasks☆30Dec 14, 2024Updated last year
- A C++ implementation of stft, melspectrogram and mel_to_stft☆11Jun 2, 2022Updated 4 years ago
- A Pytorch implementation of emotion recognition from videos☆18Sep 15, 2020Updated 5 years ago