znaoya / aenet
AENet: audio feature extraction
☆60Updated 5 years ago
Related projects: ⓘ
- Code to demonstrate multimodal LSTM☆36Updated last year
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆49Updated 4 years ago
- ☆17Updated 6 years ago
- A PyTorch implementation for SoundNet☆22Updated 7 years ago
- The source code for Temporal Attention-Gated Model.☆20Updated 7 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 6 years ago
- TensorFlow implementation of "SoundNet".☆145Updated 6 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆63Updated 6 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16☆22Updated 2 years ago
- ☆60Updated 6 years ago
- Data / annotations for video co-summarization (CVPR15)☆29Updated 7 years ago
- The code for shuttleNet.☆31Updated 7 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- Content-Based Video-Music Retrieval using Soft Intra-Modal Structure Constraint☆61Updated 6 years ago
- ☆31Updated this week
- Stochastic Adaptive Neural Architecture Search☆66Updated 5 years ago
- Keras Implementation of "Look, Listen and Learn" Model☆21Updated 6 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆11Updated last year
- Mostly for using the trained weights from https://github.com/ryankiros/visual-semantic-embedding in Keras☆20Updated 8 years ago
- 4th place solution to Google Cloud & YouTube-8M Video Understanding Challenge☆26Updated 7 years ago
- ☆71Updated 7 years ago
- This is my attempt at the ActivityNet Challenge 2017. Thanks to the organizers for providing the boilerplate code and annotated datasets.…☆9Updated 7 years ago
- Pytorch implementation of "Fast Training of Triplet-based Deep Binary Embedding Networks".☆39Updated 6 years ago
- Semi-supervised deep learning by metric embedding☆19Updated 7 years ago
- ☆29Updated 7 years ago
- Code for "Predictive-Corrective Networks for Action Detection"☆16Updated 6 years ago
- WaveNet implementation with chainer☆57Updated 7 years ago
- A Pipline for extracting and processing features from videos☆34Updated 2 years ago
- Implementation of the Budgeted Super Networks☆26Updated 5 years ago
- Auralisation of learned features in CNN (for audio)☆42Updated 7 years ago