batikim09 / LIVE_SERLinks
Live demo for speech emotion recognition using Keras and Tensorflow models
☆39Updated last year
Alternatives and similar repositories for LIVE_SER
Users that are interested in LIVE_SER are comparing it to the libraries listed below
Sorting:
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 7 years ago
- ☆40Updated 9 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆132Updated 5 years ago
- CTC for emotion recognition☆61Updated 8 years ago
- openXBOW - the Passau Open-Source Crossmodal Bag-of-Words Toolkit☆83Updated 4 years ago
- A github repo of the openSMILE feature extraction tool.☆220Updated 4 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 8 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 7 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 7 years ago
- Bidirectional LSTM network for speech emotion recognition.☆267Updated 6 years ago
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆106Updated 6 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 5 years ago
- Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019☆13Updated 6 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 7 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆24Updated 6 years ago
- Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09…☆61Updated 7 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 6 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 8 years ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆107Updated last year
- A machine learning application for emotion recognition from speech☆136Updated 7 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Updated 5 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 8 years ago
- Classify the emotions from variable-length speech segments☆11Updated 7 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 6 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Updated 4 years ago