suicao / Pytorch-Audio-Emotion-Recognition
1st Place Public Leaderboard Solution for ERC2019
☆70Updated 5 years ago
Alternatives and similar repositories for Pytorch-Audio-Emotion-Recognition:
Users that are interested in Pytorch-Audio-Emotion-Recognition are comparing it to the libraries listed below
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- [ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs☆35Updated 4 years ago
- Time series course Fall 2019 project☆54Updated 4 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆129Updated 2 months ago
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆120Updated 4 years ago
- The official repository for Audio ALBERT☆65Updated 3 years ago
- ☆106Updated 2 years ago
- ☆50Updated last year
- ☆27Updated 3 years ago
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆101Updated 5 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆132Updated 3 years ago
- The python implementation for paper "Towards Discriminative Representation Learning for Speech Emotion Recognition" in IJCAI-2019☆23Updated 5 years ago
- TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20☆32Updated 4 years ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆73Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 3 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆148Updated 3 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 3 years ago
- Accompany code to reproduce the baselines of the International Multimodal Sentiment Analysis Challenge (MuSe 2020).☆16Updated 2 years ago
- A multimodal approach on emotion recognition using audio and text.☆171Updated 4 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Augmentation adversarial training for self-supervised speaker recognition☆77Updated 3 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆144Updated 2 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆125Updated last month
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆261Updated 2 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18☆277Updated 9 months ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆316Updated 5 months ago