znaoya / aenet
AENet: audio feature extraction
☆60Updated 5 years ago
Alternatives and similar repositories for aenet:
Users that are interested in aenet are comparing it to the libraries listed below
- Code to demonstrate multimodal LSTM☆36Updated last year
- ☆17Updated 7 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 7 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆49Updated 5 years ago
- TensorFlow implementation of "SoundNet".☆145Updated 7 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆65Updated 6 years ago
- ☆59Updated 7 years ago
- Mostly for using the trained weights from https://github.com/ryankiros/visual-semantic-embedding in Keras☆20Updated 8 years ago
- The code for shuttleNet.☆31Updated 7 years ago
- Data / annotations for video co-summarization (CVPR15)☆29Updated 8 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆43Updated 7 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16☆22Updated 2 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- A PyTorch implementation for SoundNet☆22Updated 8 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 7 years ago
- A more memory efficient Torch implementation of "Densely Connected Convolutional Networks".☆29Updated 7 years ago
- ☆29Updated 8 years ago
- 2016 ActivityNet action recognition challenge. CNN + LSTM approach. Multi-threaded loading.☆53Updated 8 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 6 years ago
- The Video2GIF dataset with 100k GIFs from our paper at CVPR2016☆100Updated 7 years ago
- ☆71Updated 8 years ago
- Stochastic Adaptive Neural Architecture Search☆65Updated 6 years ago
- This is my attempt at the ActivityNet Challenge 2017. Thanks to the organizers for providing the boilerplate code and annotated datasets.…☆10Updated 7 years ago
- "Recurrent Models of Visual Attention" in TensorFlow☆41Updated 8 years ago
- Multimodal Residual Learning for Visual QA (NIPS 2016)☆38Updated 8 years ago
- An attempt to implement the recurrent attention model (RAM) from "Recurrent Models of Visual Attention" (Mnih+ 2014)☆43Updated 4 years ago
- Code for "Predictive-Corrective Networks for Action Detection"☆16Updated 7 years ago
- Download Activity Net Videos☆10Updated 9 years ago
- This is a reimplementation of 3D CNN (http://vlg.cs.dartmouth.edu/c3d/). It is compatitable with Caffe 2016. The Caffe is forked from Caf…☆9Updated 8 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 6 years ago