ilyabo / annemo
A simplistic web app for annotating emotions in human speech video recordings.
☆27Updated 9 years ago
Related projects: ⓘ
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 6 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago
- Code to demonstrate multimodal LSTM☆36Updated last year
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Updated 6 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆63Updated 6 years ago
- Convolutional neural networks for sound classification☆20Updated 6 years ago
- Generate vector embeddings for music☆18Updated 6 years ago
- Stochastic Adaptive Neural Architecture Search☆66Updated 5 years ago
- A MATLAB simulation of speech recognition based on pattern analysis, Mel Frequency Cepstral Coefficients as extracted feature and Dynamc …☆9Updated 9 years ago
- Random regression forests for audio event detection☆9Updated 7 years ago
- ☆18Updated 4 years ago
- Minimal implementation of Contrastive Predictive Coding for audio.☆16Updated 4 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 5 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- An implementation of zoneout regularizer on LSTM-RNN by Tensorflow☆25Updated 7 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 6 years ago
- Tool for online Valence and Arousal annotation.☆34Updated 3 years ago
- AENet: audio feature extraction☆60Updated 5 years ago
- ☆12Updated this week
- End to End Multiview Lip Reading☆10Updated 6 years ago
- ☆27Updated 5 years ago
- Auralisation of learned features in CNN (for audio)☆42Updated 7 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 4 years ago
- THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is c…☆32Updated 6 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 3 years ago
- ☆69Updated last year
- Siamese network for unsupervised speech representation learning☆11Updated 5 years ago
- A PyTorch implementation for SoundNet☆22Updated 7 years ago
- Augmentation scripts for the bAbI Dialog Tasks dataset☆14Updated 5 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Updated 7 years ago