znaoya / aenet
AENet: audio feature extraction
☆60Updated 5 years ago
Alternatives and similar repositories for aenet:
Users that are interested in aenet are comparing it to the libraries listed below
- Code to demonstrate multimodal LSTM☆36Updated last year
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆49Updated 5 years ago
- ☆17Updated 6 years ago
- A PyTorch implementation for SoundNet☆22Updated 7 years ago
- Sound event detection in real life audio with CNN submitted to DCASE16☆22Updated 2 years ago
- The code for shuttleNet.☆31Updated 7 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 6 years ago
- ☆59Updated 7 years ago
- TensorFlow implementation of "SoundNet".☆145Updated 6 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 7 years ago
- Stochastic Adaptive Neural Architecture Search☆65Updated 6 years ago
- Mostly for using the trained weights from https://github.com/ryankiros/visual-semantic-embedding in Keras☆20Updated 8 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- CNN+RNN video classification☆9Updated 7 years ago
- Auralisation of learned features in CNN (for audio)☆42Updated 7 years ago
- Supplementary material to "Top-down Visual Saliency Guided by Captions" (CVPR 2017)☆107Updated 7 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆65Updated 6 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 7 years ago
- Team NJU-LAMDA Code For ChaLearn LAP.☆19Updated 7 years ago
- ☆29Updated 7 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆21Updated 5 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆20Updated last year
- Data / annotations for video co-summarization (CVPR15)☆29Updated 8 years ago
- 2016 ActivityNet action recognition challenge. CNN + LSTM approach. Multi-threaded loading.☆53Updated 8 years ago
- Code and demos for our paper at ACM MM 2017☆63Updated 5 years ago
- Extract features from video file as the format in Youtube-8M☆14Updated 7 years ago
- A Pipline for extracting and processing features from videos☆34Updated 2 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 5 years ago
- Multimodal Residual Learning for Visual QA (NIPS 2016)☆38Updated 8 years ago