3loi / MSP_FaceLinks
☆12Updated 8 months ago
Alternatives and similar repositories for MSP_Face
Users that are interested in MSP_Face are comparing it to the libraries listed below
Sorting:
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Updated 2 years ago
- Repository with the code of the paper: A proposal for Multimodal Emotion Recognition using auraltransformers and Action Units on RAVDESS …☆106Updated last year
- PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition☆12Updated 3 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Updated last year
- ☆10Updated 3 years ago
- ☆51Updated 2 years ago
- Tools for downloading VoxCeleb2 dataset☆30Updated last year
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆134Updated 6 months ago
- ☆109Updated 2 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆175Updated last year
- ☆136Updated 10 months ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆60Updated last year
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆151Updated 3 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆160Updated 5 years ago
- Download and preprocess voxceleb datasets.☆31Updated 3 weeks ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆86Updated 3 years ago
- Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)☆448Updated 3 months ago
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆56Updated 8 months ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆107Updated last year
- a PyTorch implementation of Lip2Wav☆51Updated 2 years ago
- Code and models for evaluating a state-of-the-art lip reading network☆195Updated 2 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆45Updated 4 years ago
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆234Updated last year
- ☆45Updated 2 years ago
- A unified dataset of multilingual emotional human utterances☆26Updated 3 years ago
- SpeechFormer++ in PyTorch☆48Updated last year
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆67Updated last year
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆40Updated 4 years ago
- Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…☆14Updated last year