genzen2103 / Emotion-Detection-in-speech-using-Acoustic-and-Neural-Features
System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM based neural embedding
☆10Updated 6 years ago
Related projects: ⓘ
- CTC for emotion recognition☆60Updated 7 years ago
- This is a project of speech emotion recognition using KERAS based Semi-Generative Adversarial Networks.☆11Updated 6 years ago
- Bag-of-Features Acoustic Event Detection☆14Updated 7 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…☆25Updated 6 years ago
- ☆63Updated this week
- ☆27Updated 6 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆66Updated last year
- Google Speech Command Dataset Classification Neural Network, CNN, RNN☆24Updated 7 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆38Updated 6 years ago
- Phone generation model/VAE/GAN/VAE+GAN☆19Updated 6 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models☆21Updated 6 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆20Updated 7 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated last month
- ☆12Updated 6 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆37Updated 6 years ago
- A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schiz…☆52Updated 6 years ago
- Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching☆50Updated 6 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 6 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- ☆29Updated 5 years ago
- 2018年7⽉30⽇-8⽉13⽇持续2周的好未来AI训练营中语⾳情感识别营的项目报告☆32Updated 5 years ago
- openXBOW - the Passau Open-Source Crossmodal Bag-of-Words Toolkit☆81Updated 3 years ago
- Implementation of deep recurrent nonnegative matrix factorization (DR-NMF) for speech separation☆47Updated 5 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 7 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 5 years ago
- Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers☆51Updated 7 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆47Updated last year
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Updated 9 years ago