thuhcsi / IJCAI2019-DRL4SER
The python implementation for paper "Towards Discriminative Representation Learning for Speech Emotion Recognition" in IJCAI-2019
☆23Updated 5 years ago
Alternatives and similar repositories for IJCAI2019-DRL4SER:
Users that are interested in IJCAI2019-DRL4SER are comparing it to the libraries listed below
- ☆48Updated 3 years ago
- ☆36Updated 2 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 3 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- ☆53Updated 4 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆41Updated 3 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Updated 5 years ago
- Dataset and baseline for the first Audiocaption task☆79Updated 6 months ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Updated 3 years ago
- experiments about AudioSet☆44Updated last year
- ☆25Updated 3 years ago
- The official repository for Audio ALBERT☆64Updated 3 years ago
- a deep accent recognition network☆48Updated 3 years ago
- PyTorch implementation of RPNSD☆60Updated 8 months ago
- ☆17Updated 4 years ago
- Accompany code to reproduce the baselines of the International Multimodal Sentiment Analysis Challenge (MuSe 2020).☆16Updated 2 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- ☆17Updated 5 years ago
- A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'☆41Updated 6 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆22Updated 3 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆16Updated last year
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 5 years ago
- Seeing Wake Words: Audio-visual Keyword Spotting☆64Updated 4 years ago
- Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.☆29Updated 5 years ago
- Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…☆14Updated 4 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Updated 4 years ago
- ☆27Updated 2 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆25Updated 5 months ago