MixedEmotions / up_emotions_audio
This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The output is the predicted emotion in terms of Arousal and Valence within the JSON-LD format.
☆21Updated 6 years ago
Alternatives and similar repositories for up_emotions_audio:
Users that are interested in up_emotions_audio are comparing it to the libraries listed below
- Documentation for the MixedEmotions Toolbox☆46Updated 6 years ago
- Speaker diarization via transfer learning☆27Updated 5 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 5 months ago
- Tools for parsing the audio track in television news programs☆19Updated 3 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 6 years ago
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Updated 7 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Updated 8 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 4 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 5 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- An end-to-end MATLAB toolkit for completely unsupervised Speaker Diarization using state-of-the-art algorithms.☆16Updated 9 years ago
- Automatic prosodic annotation tool written in Java.☆58Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆63Updated 4 years ago
- End to End Dialect Identification using Convolutional Neural Network☆51Updated 5 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 6 years ago
- openXBOW - the Passau Open-Source Crossmodal Bag-of-Words Toolkit☆81Updated 3 years ago
- Learning embeddings for laughter categorization☆34Updated 6 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆97Updated 2 years ago
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 5 years ago
- Repository for subjective and objective evaluation of source separation algorithms☆12Updated 6 years ago
- Code for AccentDB.☆19Updated 3 years ago
- ☆65Updated 11 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 7 years ago
- Unsupervised word segmentation and clustering of speech☆13Updated 7 years ago
- Collaborative audio annotation tool☆17Updated 2 years ago
- ABX and kaldi experiments on speech corpora made easy☆31Updated 3 months ago