MixedEmotions / up_emotions_audioLinks
This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The output is the predicted emotion in terms of Arousal and Valence within the JSON-LD format.
☆22Updated 7 years ago
Alternatives and similar repositories for up_emotions_audio
Users that are interested in up_emotions_audio are comparing it to the libraries listed below
Sorting:
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 10 months ago
- Documentation for the MixedEmotions Toolbox☆45Updated 7 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 6 years ago
- Tools for parsing the audio track in television news programs☆19Updated 4 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Updated 8 years ago
- An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.☆16Updated 9 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- Speaker diarization via transfer learning☆27Updated 6 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆48Updated 8 years ago
- ☆36Updated 8 years ago
- Convolutional neural networks for sound classification☆20Updated 7 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 4 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- openXBOW - the Passau Open-Source Crossmodal Bag-of-Words Toolkit☆82Updated 4 years ago
- ACLEW Diarization Virtual Machine☆32Updated 5 years ago
- Collaborative audio annotation tool☆17Updated 2 years ago
- Util code, issues, discussions☆29Updated 6 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆13Updated 9 years ago
- Deep Convolutional Networks on the Pitch Spiral for Musical Instrument Recognition☆41Updated 8 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 3 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Updated 9 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- A script for audio/transcript alignment. Fork of p2fa.☆68Updated 7 years ago
- Learning embeddings for laughter categorization☆34Updated 6 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆23Updated 6 years ago
- Unsupervised Speaker Clustering & Speaker Recognition☆12Updated 6 years ago