znaoya / aenet
AENet: audio feature extraction
☆60Updated 5 years ago
Alternatives and similar repositories for aenet:
Users that are interested in aenet are comparing it to the libraries listed below
- Code to demonstrate multimodal LSTM☆36Updated last year
- ☆17Updated 7 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 7 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆49Updated 5 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆43Updated 7 years ago
- Data / annotations for video co-summarization (CVPR15)☆29Updated 8 years ago
- A PyTorch implementation for SoundNet☆22Updated 8 years ago
- TensorFlow implementation of "SoundNet".☆145Updated 7 years ago
- ☆59Updated 7 years ago
- ☆29Updated 8 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 7 years ago
- Auralisation of learned features in CNN (for audio)☆42Updated 8 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16☆22Updated 2 years ago
- An attempt to implement the recurrent attention model (RAM) from "Recurrent Models of Visual Attention" (Mnih+ 2014)☆43Updated 4 years ago
- Mostly for using the trained weights from https://github.com/ryankiros/visual-semantic-embedding in Keras☆20Updated 9 years ago
- Stochastic Adaptive Neural Architecture Search☆65Updated 6 years ago
- 2016 ActivityNet action recognition challenge. CNN + LSTM approach. Multi-threaded loading.☆53Updated 8 years ago
- Content-Based Video-Music Retrieval using Soft Intra-Modal Structure Constraint☆61Updated 7 years ago
- Code and demos for our paper at ACM MM 2017☆62Updated 6 years ago
- 4th place solution to Google Cloud & YouTube-8M Video Understanding Challenge☆26Updated 7 years ago
- A dataset with user created GIFs☆65Updated 6 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆20Updated 5 years ago
- Signal Processing Library for PyTorch☆38Updated 7 years ago
- Code for "Predictive-Corrective Networks for Action Detection"☆16Updated 7 years ago
- A Pipline for extracting and processing features from videos☆34Updated 2 years ago
- This is a reimplementation of 3D CNN (http://vlg.cs.dartmouth.edu/c3d/). It is compatitable with Caffe 2016. The Caffe is forked from Caf…☆9Updated 8 years ago
- PyTorch implementation of Video Summarization on Twitch (LOL) dataset☆38Updated 6 years ago
- The code for shuttleNet.☆31Updated 7 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 7 years ago