michen00 / unified_multilingual_dataset_of_emotional_human_utterancesLinks
A unified dataset of multilingual emotional human utterances
☆28Updated 3 years ago
Alternatives and similar repositories for unified_multilingual_dataset_of_emotional_human_utterances
Users that are interested in unified_multilingual_dataset_of_emotional_human_utterances are comparing it to the libraries listed below
Sorting:
- MSP-Podcast Challenge Baseline Code☆26Updated last year
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆48Updated last year
- ☆30Updated 3 years ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆28Updated 10 months ago
- ☆52Updated 4 years ago
- ☆69Updated last year
- ☆12Updated last year
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆137Updated 9 months ago
- ☆110Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 3 years ago
- ☆45Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆32Updated 3 years ago
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Updated last year
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆25Updated last year
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆90Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆43Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Updated 3 years ago
- ☆122Updated 3 years ago
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆16Updated 4 years ago
- How to use our public wav2vec2 age and gender model☆50Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- ☆26Updated last year
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆39Updated last year
- ☆19Updated last year
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27Updated 4 months ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆17Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Updated 10 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆68Updated 3 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 6 months ago