genzen2103 / Emotion-Detection-in-speech-using-Acoustic-and-Neural-FeaturesLinks
System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM based neural embedding
☆10Updated 8 years ago
Alternatives and similar repositories for Emotion-Detection-in-speech-using-Acoustic-and-Neural-Features
Users that are interested in Emotion-Detection-in-speech-using-Acoustic-and-Neural-Features are comparing it to the libraries listed below
Sorting:
- CTC for emotion recognition☆61Updated 8 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 8 years ago
- Bidirectional LSTM network for speech emotion recognition.☆267Updated 6 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 7 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Updated 8 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆132Updated 5 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated last year
- Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching☆51Updated 7 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 3 years ago
- A github repo of the openSMILE feature extraction tool.☆220Updated 4 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…☆25Updated 7 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆40Updated 7 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- openXBOW - the Passau Open-Source Crossmodal Bag-of-Words Toolkit☆83Updated 4 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- This is a project of speech emotion recognition using KERAS based Semi-Generative Adversarial Networks.☆11Updated 7 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Updated 8 years ago
- ☆40Updated 9 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 8 years ago
- Bag-of-Features Acoustic Event Detection☆14Updated 9 years ago
- Baseline scripts of the 8th Audio/Visual Emotion Challenge (AVEC 2018)☆60Updated 7 years ago
- A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'☆41Updated 7 years ago
- Speaker identification with VGGVox network☆84Updated 7 years ago
- Deep Learning experiments for audio classification☆148Updated 8 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆54Updated 2 years ago
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆106Updated 6 years ago
- DCASE 2018 Baseline systems☆130Updated 6 years ago
- DCASE 2017 Baseline system☆82Updated 5 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 7 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆75Updated 4 years ago