MixedEmotions / up_emotions_audio
This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The output is the predicted emotion in terms of Arousal and Valence within the JSON-LD format.
☆21Updated 6 years ago
Related projects: ⓘ
- Documentation for the MixedEmotions Toolbox☆46Updated 6 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated last month
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 6 years ago
- Learning embeddings for laughter categorization☆34Updated 5 years ago
- A script for audio/transcript alignment. Fork of p2fa.☆68Updated 6 years ago
- ☆12Updated this week
- Collaborative audio annotation tool☆17Updated 2 years ago
- Audio Analysis by Conceptor☆30Updated 9 years ago
- openXBOW - the Passau Open-Source Crossmodal Bag-of-Words Toolkit☆81Updated 3 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆61Updated 4 years ago
- Speaker diarization via transfer learning☆27Updated 5 years ago
- ☆65Updated 10 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆80Updated 4 months ago
- Automatic prosodic annotation tool written in Java.☆56Updated 5 years ago
- An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.☆16Updated 8 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 7 years ago
- ☆36Updated 7 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆22Updated 5 years ago
- Tools for parsing the audio track in television news programs☆19Updated 3 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆14Updated 4 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- audio cfeatures extraction tool from wav to h5features format☆18Updated 5 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆31Updated last year
- ☆26Updated 7 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 4 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆97Updated 2 years ago
- Code for AccentDB.☆20Updated 3 years ago
- ☆33Updated this week
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆13Updated 8 years ago
- Theano implementation of Sequence-to-Sequence Autoencoder☆13Updated 6 years ago