AuxFormer: Robust Approach to Audiovisual Emotion Recognition
☆14Mar 14, 2023Updated 2 years ago
Alternatives and similar repositories for AuxFormer
Users that are interested in AuxFormer are comparing it to the libraries listed below
Sorting:
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆33Nov 29, 2024Updated last year
- ☆20Oct 23, 2022Updated 3 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- This repository provides implementation for the paper "Self-attention fusion for audiovisual emotion recognition with incomplete data".☆160Sep 16, 2024Updated last year
- This repository provides the ability to recoginize the emotion from video using audiovisual modalities。端到端的多模态情感识别代码☆11Mar 5, 2023Updated 3 years ago
- This is the code for Coupled-translation Fusion Network.☆11Dec 2, 2021Updated 4 years ago
- TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20☆33Aug 10, 2020Updated 5 years ago
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆50Apr 11, 2022Updated 3 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- [AAAI 2023] AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work☆23Dec 7, 2025Updated 3 months ago
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Jul 1, 2024Updated last year
- ☆29Mar 8, 2022Updated 4 years ago
- Official PyTorch implementation of Multilogue-Net (Best paper runner-up at Challenge-HML @ ACL 2020)☆58Dec 8, 2022Updated 3 years ago
- ☆28May 13, 2022Updated 3 years ago
- ☆28Nov 14, 2022Updated 3 years ago
- The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.☆123Sep 20, 2021Updated 4 years ago
- Implementation of the paper "Real-Time Emotion Recognition via Attention Gated Hierarchical Memory Network" in AAAI-2020.☆31Sep 2, 2022Updated 3 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Jan 6, 2025Updated last year
- 缺陷检测是图像处理领域一个应用广泛的问题。本课题依托科研项目,采用无人机上的图像探测器采集工厂内部货架图片;通过图片配准及比对,识别螺丝松动等缺陷。从而防止隐患的发生。也可以使用公开数据集处理,课题主要是算法,不限制算法依托的软件平台。☆16Feb 20, 2024Updated 2 years ago
- NAACL 2022 paper on Analyzing Modality Robustness in Multimodal Sentiment Analysis☆31Jan 21, 2023Updated 3 years ago
- ☆43Jun 27, 2022Updated 3 years ago
- A demo for multi-modal emotion recognition.(多模态情感识别demo)☆92Apr 2, 2024Updated last year
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated last week
- An online emotion recognition classifier using audio-visual modalities and deep reinforcement learning.☆10Jun 25, 2020Updated 5 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆40Apr 20, 2025Updated 10 months ago
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆45Nov 29, 2024Updated last year
- PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…☆11Jul 10, 2020Updated 5 years ago
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆20Jul 29, 2025Updated 7 months ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- ☆10Mar 24, 2023Updated 2 years ago
- A C++ implementation of stft, melspectrogram and mel_to_stft☆10Jun 2, 2022Updated 3 years ago
- ☆10Feb 13, 2025Updated last year
- ☆13May 21, 2024Updated last year
- Conversational Multimodal Emotion Recognition☆11Dec 7, 2020Updated 5 years ago
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year