samsad35 / VQ-MAE-S-codeLinks
A Vector Quantized Masked AutoEncoder for speech emotion recognition
☆22Updated last year
Alternatives and similar repositories for VQ-MAE-S-code
Users that are interested in VQ-MAE-S-code are comparing it to the libraries listed below
Sorting:
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆39Updated 11 months ago
- Official implement of SpeechFormer written in Python (PyTorch).☆80Updated 2 years ago
- SpeechFormer++ in PyTorch☆48Updated last year
- [IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer☆158Updated last month
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆60Updated 10 months ago
- ☆164Updated 10 months ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆14Updated 3 months ago
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆24Updated last year
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆39Updated 4 years ago
- ☆18Updated last year
- EMO-SUPERB submission☆42Updated 9 months ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆145Updated last year
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆37Updated last year
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆120Updated 7 months ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆30Updated 3 months ago
- FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION☆22Updated 5 months ago
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…☆59Updated 11 months ago
- Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)☆40Updated 2 months ago
- ☆37Updated 11 months ago
- ☆13Updated 11 months ago
- ☆19Updated 2 years ago
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆143Updated 6 months ago
- ☆64Updated 2 weeks ago
- Pytorch implementation for “V2C: Visual Voice Cloning”☆32Updated 2 years ago
- Trustworthy Speech Emotion Recognition☆13Updated 2 years ago
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆83Updated 6 months ago
- [Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units☆39Updated 7 months ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆150Updated 3 years ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆24Updated 6 months ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆34Updated 3 years ago