HappyColor / VesperLinks
A Compact and Effective Pretrained Model for Speech Emotion Recognition
☆47Updated last year
Alternatives and similar repositories for Vesper
Users that are interested in Vesper are comparing it to the libraries listed below
Sorting:
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆39Updated last year
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆25Updated last year
- ☆43Updated 2 years ago
- Official implement of SpeechFormer written in Python (PyTorch).☆82Updated 2 years ago
- SpeechFormer++ in PyTorch☆49Updated 2 years ago
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Updated last year
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆42Updated 4 years ago
- ☆109Updated 3 years ago
- ☆12Updated last year
- MSP-Podcast Challenge Baseline Code☆25Updated last year
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆27Updated 9 months ago
- EMO-SUPERB submission☆45Updated last year
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆152Updated 3 years ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆15Updated last year
- Official repository of NeXt-TDNN for speaker verification☆78Updated 11 months ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆37Updated 6 months ago
- ☆52Updated 4 years ago
- ☆30Updated 2 years ago
- ☆41Updated 4 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆136Updated 8 months ago
- ☆19Updated 2 years ago
- ☆17Updated 3 years ago
- ☆56Updated 2 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆22Updated 11 months ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆63Updated last year
- ☆68Updated last year
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆22Updated last year
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆29Updated last year
- ☆13Updated last year