jdh-algo / MHAD-Dataset
Multimodal Home Activity Dataset with Multi-Angle Videos and Synchronized Physiological Signals
β16Updated last month
Alternatives and similar repositories for MHAD-Dataset:
Users that are interested in MHAD-Dataset are comparing it to the libraries listed below
- β14Updated 9 months ago
- [ECCV 2024π₯] The official code for the paper AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors.β48Updated 6 months ago
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,β55Updated 2 months ago
- This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation β¦β25Updated 4 months ago
- The official implementation of OpenSR (ACL2023 Oral)β15Updated last year
- Code for paper multi-scale dynamic and hierarchical relationship modeling for facial action units recognitionβ22Updated 7 months ago
- Implementation of the paper: StyleBERT: Text-Audio Sentiment Analysis with Bi-directional Style Enhancementβ13Updated last year
- [TAC 2024] SVFAP: Self-supervised Video Facial Affect Perceiverβ12Updated 4 months ago
- [CVPR 2023] Code for "Learning Emotion Representations from Verbal and Nonverbal Communication"β43Updated last year
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transforβ¦β20Updated last year
- [ACII Demo] Emolysis: A Multimodal Open-Source Group Emotion Analysis and Visualization Toolkitβ11Updated last month
- Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Modelsβ14Updated 9 months ago
- Trustworthy Speech Emotion Recognitionβ13Updated last year
- [BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognitionβ117Updated 2 months ago
- [ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"β25Updated 2 weeks ago
- β22Updated 10 months ago
- β24Updated 6 months ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)β56Updated 6 months ago
- This is the official code repository of our dataset and ECCV 2024 paper entitled "Oulu Remote-photoplethysmography Physical Domain Attacβ¦β10Updated 4 months ago
- [CVPR 2024] EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuningβ23Updated 4 months ago
- Download and preprocess voxceleb datasets.β24Updated 8 months ago
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)β14Updated 2 years ago
- QAFE-Net: Quality Assessment of Facial Expressions with Landmark Heatmapsβ12Updated 6 months ago
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"β11Updated 2 months ago
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognitionβ20Updated 9 months ago
- β9Updated 6 months ago
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"β72Updated 2 months ago
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITIONβ18Updated last month
- Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art (WACVW'24)β13Updated last year
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB imaβ¦β17Updated 9 months ago