MixedEmotions / up_emotions_audio
This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The output is the predicted emotion in terms of Arousal and Valence within the JSON-LD format.
☆21Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for up_emotions_audio
- Documentation for the MixedEmotions Toolbox☆46Updated 6 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 3 months ago
- This package contains functions for converting wav files into auditory representations and comparing them☆55Updated 2 months ago
- An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.☆16Updated 8 years ago
- openXBOW - the Passau Open-Source Crossmodal Bag-of-Words Toolkit☆81Updated 3 years ago
- Speaker diarization via transfer learning☆27Updated 5 years ago
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Updated 7 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 6 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Tools for parsing the audio track in television news programs☆19Updated 3 years ago
- Dialect identification using Siamese network☆15Updated 6 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆14Updated 4 years ago
- audio cfeatures extraction tool from wav to h5features format☆19Updated 5 years ago
- A script for audio/transcript alignment. Fork of p2fa.☆69Updated 6 years ago
- ☆36Updated 7 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆97Updated 2 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆62Updated 4 years ago
- ACLEW Diarization Virtual Machine☆32Updated 5 years ago
- Automatic prosodic annotation tool written in Java.☆57Updated 5 years ago
- ABX and kaldi experiments on speech corpora made easy☆31Updated last month
- Learning embeddings for laughter categorization☆34Updated 6 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 4 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆13Updated 8 years ago
- Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data☆9Updated 6 years ago
- End to End Dialect Identification using Convolutional Neural Network☆51Updated 5 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆76Updated 6 years ago