jdh-algo / MHAD-DatasetLinks
Multimodal Home Activity Dataset with Multi-Angle Videos and Synchronized Physiological Signals
☆18Updated 6 months ago
Alternatives and similar repositories for MHAD-Dataset
Users that are interested in MHAD-Dataset are comparing it to the libraries listed below
Sorting:
- ☆9Updated 11 months ago
- Trustworthy Speech Emotion Recognition☆13Updated 2 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆60Updated 11 months ago
- [ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"☆27Updated last month
- Download and preprocess voxceleb datasets.☆31Updated last week
- ☆19Updated last month
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆24Updated last year
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆18Updated last year
- Lightweight Video-based Respiration Rate Detection Algorithm☆23Updated last year
- RhythmFormer [Pattern Recognition]☆37Updated 3 months ago
- [ECCV 2024🔥] The official code for the paper AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors.☆52Updated 11 months ago
- Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.☆18Updated 3 weeks ago
- ☆41Updated last year
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Updated last year
- This is the official code repository of our dataset and ECCV 2024 paper entitled "Oulu Remote-photoplethysmography Physical Domain Attac…☆12Updated last month
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆13Updated 3 years ago
- ☆36Updated last year
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Updated 2 years ago
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆20Updated 3 weeks ago
- [TPAMI & ECCV 2022] Contrast-Phys & Contrast-Phys+ for facial video-based remote physiological signal measurement☆83Updated last year
- SpeechFormer++ in PyTorch☆48Updated last year
- The source code and pre-trained models for Motion Matters: Neural Motion Transfer for Better Camera Physiological Sensing (WACV 2024, Ora…☆65Updated last year
- ☆12Updated last year
- rPPG; domain generalization; domain-label-free approach; NEuron STructure modeling (NEST);agnostic domain generalization.☆44Updated last year
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆21Updated 2 years ago
- [CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation☆37Updated 9 months ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆37Updated last year
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆12Updated 6 months ago
- FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing [NeurIPS 2024]☆19Updated 2 weeks ago
- Project to infere emotional expressions and benchmark datasets by Niklas Wagner, Felix Mätzler, Samed R. Vossberg, Helen Schneider and Sv…☆26Updated 3 months ago