ilucasgoncalves / AuxFormerView external linksLinks
AuxFormer: Robust Approach to Audiovisual Emotion Recognition
☆14Mar 14, 2023Updated 2 years ago
Alternatives and similar repositories for AuxFormer
Users that are interested in AuxFormer are comparing it to the libraries listed below
Sorting:
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆33Nov 29, 2024Updated last year
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- ☆20Oct 23, 2022Updated 3 years ago
- This repository provides the ability to recoginize the emotion from video using audiovisual modalities。端到端的多模态情感识别代码☆11Mar 5, 2023Updated 2 years ago
- TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20☆33Aug 10, 2020Updated 5 years ago
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆58Apr 17, 2024Updated last year
- ☆27Oct 7, 2021Updated 4 years ago
- ☆26May 8, 2022Updated 3 years ago
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Jul 1, 2024Updated last year
- ☆29Mar 8, 2022Updated 3 years ago
- Official PyTorch implementation of Multilogue-Net (Best paper runner-up at Challenge-HML @ ACL 2020)☆58Dec 8, 2022Updated 3 years ago
- ☆28Nov 14, 2022Updated 3 years ago
- ☆28May 13, 2022Updated 3 years ago
- The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.☆123Sep 20, 2021Updated 4 years ago
- Implementation of the paper "Real-Time Emotion Recognition via Attention Gated Hierarchical Memory Network" in AAAI-2020.☆31Sep 2, 2022Updated 3 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Jan 6, 2025Updated last year
- experiments about AudioSet☆43Jul 22, 2023Updated 2 years ago
- This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …☆50Sep 16, 2024Updated last year
- NAACL 2022 paper on Analyzing Modality Robustness in Multimodal Sentiment Analysis☆31Jan 21, 2023Updated 3 years ago
- ☆43Jun 27, 2022Updated 3 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 5, 2026Updated last week
- A demo for multi-modal emotion recognition.(多模态情感识别demo)☆91Apr 2, 2024Updated last year
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- An online emotion recognition classifier using audio-visual modalities and deep reinforcement learning.☆10Jun 25, 2020Updated 5 years ago
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆40Apr 20, 2025Updated 9 months ago
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆45Nov 29, 2024Updated last year
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- ☆13Aug 14, 2022Updated 3 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- ☆13May 21, 2024Updated last year
- A C++ implementation of stft, melspectrogram and mel_to_stft☆10Jun 2, 2022Updated 3 years ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated last year
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 5 months ago
- Conversational Multimodal Emotion Recognition☆11Dec 7, 2020Updated 5 years ago
- Official Code for "A Likelihood Ratio-Based Approach to Segmenting Unknown Objects" [IJCV 2025]☆15Jun 9, 2025Updated 8 months ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- Code for "ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation" (NeurIPS 23)☆14Apr 12, 2024Updated last year