3loi / MSP_Face
☆10Updated 3 months ago
Alternatives and similar repositories for MSP_Face:
Users that are interested in MSP_Face are comparing it to the libraries listed below
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆57Updated 7 months ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆22Updated 11 months ago
- PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition☆12Updated 2 years ago
- Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…☆13Updated last year
- ☆10Updated 2 years ago
- ☆104Updated 2 years ago
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆56Updated 3 months ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆39Updated 2 years ago
- ☆27Updated 2 years ago
- ☆46Updated 2 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆127Updated last month
- the implementation of chunk-level attention-based temporal aggregation framework for sequence-to-one recognition tasks☆8Updated 11 months ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆62Updated 7 months ago
- Tools for downloading VoxCeleb2 dataset☆28Updated 11 months ago
- Repository with the code of the paper: A proposal for Multimodal Emotion Recognition using auraltransformers and Action Units on RAVDESS …☆101Updated 10 months ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆47Updated last year
- A unified dataset of multilingual emotional human utterances☆24Updated 3 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆31Updated 5 years ago
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"