thuhcsi / IJCAI2019-DRL4SERView external linksLinks
The python implementation for paper "Towards Discriminative Representation Learning for Speech Emotion Recognition" in IJCAI-2019
☆23Aug 12, 2019Updated 6 years ago
Alternatives and similar repositories for IJCAI2019-DRL4SER
Users that are interested in IJCAI2019-DRL4SER are comparing it to the libraries listed below
Sorting:
- ☆17Feb 14, 2020Updated 6 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- ☆12Oct 2, 2020Updated 5 years ago
- Implementation of the paper "Emotion Identification from raw speech signals using DNNs"☆14Jun 11, 2020Updated 5 years ago
- ☆28May 13, 2022Updated 3 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Dec 20, 2020Updated 5 years ago
- CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition☆12Oct 7, 2019Updated 6 years ago
- [ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs☆35May 17, 2020Updated 5 years ago
- Accompany code to reproduce the baselines of the International Multimodal Sentiment Analysis Challenge (MuSe 2020).☆16Dec 8, 2022Updated 3 years ago
- Event Relation in Text-to-Audio (TTA) Generation☆20Feb 26, 2025Updated 11 months ago
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆38Dec 16, 2024Updated last year
- ☆20Nov 22, 2020Updated 5 years ago
- Download and create a tfreader for the audioset dataset☆16Apr 16, 2020Updated 5 years ago
- DeepCU: Integrating Both Common and Unique Latent Information for Multimodal Sentiment Analysis, IJCAI-19☆19Nov 21, 2019Updated 6 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- Official code of "IRNet: Iterative Refinement Network for Noisy Partial Label Learning"☆21Oct 8, 2025Updated 4 months ago
- CCA, DCCA, DCCAE, ConvCCA☆21Dec 16, 2020Updated 5 years ago
- Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022☆20Mar 4, 2022Updated 3 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆50Dec 17, 2024Updated last year
- Official implementation of the Seq-U-Net for efficient sequence modelling☆80Jul 25, 2024Updated last year
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Jul 4, 2018Updated 7 years ago
- Urdu Language Speech Emotional Corpus☆46Jan 17, 2019Updated 7 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆26Jan 11, 2022Updated 4 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Jul 13, 2023Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- Repository containg experiments with Extreme Learning Machines And Reservoir Computing, ELMARC.☆20May 1, 2018Updated 7 years ago
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆27Mar 4, 2025Updated 11 months ago
- Official code of "ALIM: Adjusting Label Importance Mechanism for Noisy Partial Label Learning"☆24Sep 25, 2023Updated 2 years ago
- [AAAI 2023] AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work☆23Dec 7, 2025Updated 2 months ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- ☆64Sep 26, 2022Updated 3 years ago
- 2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…☆24Aug 3, 2023Updated 2 years ago
- Baseline scripts of the 8th Audio/Visual Emotion Challenge (AVEC 2018)☆60Jul 4, 2018Updated 7 years ago
- Acoustic Scene Classification Using Deep Residual Networks with Late Fusion of Separated High and Low Frequency Paths - McDonnell and Gao…☆22Jul 3, 2024Updated last year
- ☆27Apr 29, 2025Updated 9 months ago
- ☆29Mar 8, 2022Updated 3 years ago
- ☆108Aug 24, 2022Updated 3 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Mar 18, 2021Updated 4 years ago