HappyColor / VesperLinks
A Compact and Effective Pretrained Model for Speech Emotion Recognition
☆40Updated 11 months ago
Alternatives and similar repositories for Vesper
Users that are interested in Vesper are comparing it to the libraries listed below
Sorting:
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆24Updated last year
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆37Updated last year
- MSP-Podcast Challenge Baseline Code☆24Updated last year
- ☆43Updated 2 years ago
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆40Updated 4 years ago
- Official implement of SpeechFormer written in Python (PyTorch).☆80Updated 2 years ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆24Updated 6 months ago
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…☆60Updated 11 months ago
- ☆19Updated 2 years ago
- SpeechFormer++ in PyTorch☆48Updated last year
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆31Updated 3 months ago
- ☆12Updated last year
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆47Updated 3 years ago
- ☆109Updated 2 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆60Updated 11 months ago
- ☆41Updated 4 years ago
- EMO-SUPERB submission☆43Updated 9 months ago
- ☆51Updated 3 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆41Updated 2 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆21Updated 2 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆44Updated 3 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27Updated 3 weeks ago
- ☆17Updated 3 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- ☆30Updated last year
- Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch☆38Updated last year
- Official repository of NeXt-TDNN for speaker verification☆72Updated 8 months ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆20Updated 10 months ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆19Updated 9 months ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Updated 2 years ago