genzen2103 / Emotion-Detection-in-speech-using-Acoustic-and-Neural-Features
System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM based neural embedding
☆10Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for Emotion-Detection-in-speech-using-Acoustic-and-Neural-Features
- CTC for emotion recognition☆60Updated 7 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆66Updated 2 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Bag-of-Features Acoustic Event Detection☆14Updated 8 years ago
- openXBOW - the Passau Open-Source Crossmodal Bag-of-Words Toolkit☆81Updated 3 years ago
- Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching☆51Updated 6 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…☆25Updated 6 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 3 months ago
- This is a project of speech emotion recognition using KERAS based Semi-Generative Adversarial Networks.☆11Updated 6 years ago
- ☆29Updated 6 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆37Updated 6 years ago
- ☆31Updated 7 years ago
- ☆27Updated 6 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆38Updated 6 years ago
- ☆12Updated 6 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆50Updated last year
- ☆40Updated 8 years ago
- Implementation of deep recurrent nonnegative matrix factorization (DR-NMF) for speech separation☆48Updated 5 years ago
- System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models☆21Updated 7 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆101Updated 5 years ago
- Baseline scripts of the 8th Audio/Visual Emotion Challenge (AVEC 2018)☆57Updated 6 years ago
- ☆108Updated 2 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Updated 8 years ago
- Framewise phoneme classification on the TIMIT dataset using neural networks☆19Updated 8 years ago
- ☆54Updated 5 years ago
- DCASE 2017 Baseline system☆82Updated 4 years ago