ilyabo / annemo
A simplistic web app for annotating emotions in human speech video recordings.
☆27Updated 10 years ago
Related projects ⓘ
Alternatives and complementary repositories for annemo
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 3 months ago
- Code to demonstrate multimodal LSTM☆36Updated last year
- Convolutional neural networks for sound classification☆20Updated 6 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- Baseline scripts of the 8th Audio/Visual Emotion Challenge (AVEC 2018)☆57Updated 6 years ago
- ☆15Updated 6 years ago
- This is a project of speech emotion recognition using KERAS based Semi-Generative Adversarial Networks.☆11Updated 6 years ago
- An implementation of zoneout regularizer on LSTM-RNN by Tensorflow☆25Updated 7 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 5 years ago
- Generate vector embeddings for music☆19Updated 7 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆63Updated 6 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- A MATLAB simulation of speech recognition based on pattern analysis, Mel Frequency Cepstral Coefficients as extracted feature and Dynamc …☆9Updated 9 years ago
- ☆27Updated 5 years ago
- Auralisation of learned features in CNN (for audio)☆42Updated 7 years ago
- ☆18Updated 6 years ago
- Minimal implementation of Contrastive Predictive Coding for audio.☆16Updated 5 years ago
- ☆27Updated 7 years ago
- audio cfeatures extraction tool from wav to h5features format☆19Updated 5 years ago
- Siamese network for unsupervised speech representation learning☆11Updated 6 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Network specification and demo☆35Updated 7 years ago
- ☆18Updated 5 years ago
- a replicate of https://arxiv.org/pdf/1711.00937.pdf☆16Updated 7 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 6 years ago
- This repository contains the code to reproduce the core results from the paper "Scalable Factorized Hierarchical Variational Autoencoders…☆52Updated 6 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Dialect identification using Siamese network☆15Updated 6 years ago