jdh-algo / MHAD-DatasetLinks
Multimodal Home Activity Dataset with Multi-Angle Videos and Synchronized Physiological Signals
☆20Updated 11 months ago
Alternatives and similar repositories for MHAD-Dataset
Users that are interested in MHAD-Dataset are comparing it to the libraries listed below
Sorting:
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆14Updated 3 years ago
- ☆23Updated last week
- Implementation of the paper: StyleBERT: Text-Audio Sentiment Analysis with Bi-directional Style Enhancement☆14Updated 2 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆65Updated last year
- Trustworthy Speech Emotion Recognition☆13Updated 2 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆23Updated 2 years ago
- ☆19Updated last year
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆25Updated last year
- Official implementation of "Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound". IEEE TASLP 20…☆16Updated 2 months ago
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆47Updated last month
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Updated last year
- This is the official code repository of our dataset and ECCV 2024 paper entitled "Oulu Remote-photoplethysmography Physical Domain Attac…☆13Updated 4 months ago
- SpeechFormer++ in PyTorch☆49Updated 2 years ago
- ICSD Dataset☆37Updated 5 months ago
- Official PyTorch implementation for "Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech …☆30Updated 6 months ago
- Download and preprocess voxceleb datasets.☆39Updated 5 months ago
- ☆23Updated 4 months ago
- ☆24Updated last year
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆22Updated 5 months ago
- DBPNet model☆45Updated 11 months ago
- The official implementation of OpenSR (ACL2023 Oral)☆16Updated 2 years ago
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆55Updated last month
- This repo is for measuring the heart rate and respiration rate using the webcam. Working on SpO2 oxygen level and will try for blood pres…☆12Updated 3 years ago
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…☆19Updated 11 months ago
- [ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"☆31Updated 6 months ago
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆28Updated 2 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆13Updated 9 months ago
- [CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation☆42Updated last year
- ☆15Updated 8 months ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆39Updated last year