ETZET / SpeechEmotionAVLearningLinks
☆12Updated last year
Alternatives and similar repositories for SpeechEmotionAVLearning
Users that are interested in SpeechEmotionAVLearning are comparing it to the libraries listed below
Sorting:
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆48Updated last year
- ☆45Updated 2 years ago
- ☆19Updated 2 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆44Updated 3 years ago
- MSP-Podcast Challenge Baseline Code☆26Updated last year
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆28Updated 10 months ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆48Updated 10 months ago
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆93Updated 3 years ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆104Updated 2 years ago
- ☆58Updated 2 years ago
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆25Updated last year
- ☆31Updated 2 years ago
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆67Updated 2 years ago
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Updated last year
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Updated 2 years ago
- ☆32Updated 11 months ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Updated 2 years ago
- ☆52Updated 4 years ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆38Updated 7 months ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Updated 10 months ago
- Official implement of SpeechFormer written in Python (PyTorch).☆80Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆68Updated 3 years ago
- ☆69Updated last year
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27Updated 4 months ago
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆43Updated 4 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- Python Implementation of Visual Relative Attributes for Image Classification and Zero Shot Learning☆22Updated 7 years ago