ilyabo / annemo
A simplistic web app for annotating emotions in human speech video recordings.
☆28Updated 10 years ago
Alternatives and similar repositories for annemo:
Users that are interested in annemo are comparing it to the libraries listed below
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 7 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 7 years ago
- Code to demonstrate multimodal LSTM☆36Updated last year
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 9 months ago
- Baseline scripts of the 8th Audio/Visual Emotion Challenge (AVEC 2018)☆58Updated 6 years ago
- ☆36Updated 8 years ago
- [CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog☆34Updated 4 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆65Updated 6 years ago
- Tool for online Valence and Arousal annotation.☆35Updated 4 years ago
- ☆110Updated 2 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆21Updated 2 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 2 years ago
- AENet: audio feature extraction☆60Updated 5 years ago
- Convolutional neural networks for sound classification☆20Updated 7 years ago
- Minimal implementation of Contrastive Predictive Coding for audio.☆16Updated 5 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 7 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 6 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆20Updated 5 years ago
- Supporting code for "Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"☆101Updated 5 years ago
- ☆12Updated 8 years ago
- Implementations of vanilla autoencoder, VAE, and GAN in Tensorflow☆17Updated 7 years ago
- ☆15Updated 7 years ago
- This is a project of speech emotion recognition using KERAS based Semi-Generative Adversarial Networks.☆11Updated 6 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆43Updated 7 years ago
- End to End Multiview Lip Reading☆10Updated 7 years ago
- Multimodal sentiment analysis using hierarchical fusion with context modeling☆44Updated 2 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 7 years ago