znaoya / aenetLinks
AENet: audio feature extraction
☆60Updated 6 years ago
Alternatives and similar repositories for aenet
Users that are interested in aenet are comparing it to the libraries listed below
Sorting:
- Code to demonstrate multimodal LSTM☆36Updated 2 years ago
- ☆59Updated 8 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆43Updated 8 years ago
- ☆29Updated 8 years ago
- TensorFlow implementation of "SoundNet".☆145Updated 7 years ago
- The Video2GIF dataset with 100k GIFs from our paper at CVPR2016☆99Updated 8 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 8 years ago
- Stochastic Adaptive Neural Architecture Search☆65Updated 7 years ago
- Attention Bidirectional Video Recurrent Net☆56Updated 6 years ago
- Various implementations and experimentation for deep neural network model compression☆24Updated 7 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 6 years ago
- Pytorch implementation of "Fast Training of Triplet-based Deep Binary Embedding Networks".☆40Updated 8 years ago
- Mostly for using the trained weights from https://github.com/ryankiros/visual-semantic-embedding in Keras☆20Updated 9 years ago
- ☆70Updated 8 years ago
- Signal Processing Library for PyTorch☆39Updated 8 years ago
- ☆17Updated 7 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆51Updated 6 years ago
- Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…☆44Updated 6 years ago
- Torch implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).☆130Updated 8 years ago
- A two-stream convolutional neural network for learning abitrary similarity functions over two sets of training data☆24Updated 8 years ago
- Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang☆20Updated 9 years ago
- A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch☆124Updated 7 years ago
- 4th place solution to Google Cloud & YouTube-8M Video Understanding Challenge☆26Updated 8 years ago
- Multimodal Compact Bilinear Pooling for Torch7☆69Updated 8 years ago
- 2016 ActivityNet action recognition challenge. CNN + LSTM approach. Multi-threaded loading.☆53Updated 9 years ago
- Code for "Predictive-Corrective Networks for Action Detection"☆16Updated 8 years ago
- Prunable nn layers for pytorch.☆48Updated 7 years ago
- An attempt to implement the recurrent attention model (RAM) from "Recurrent Models of Visual Attention" (Mnih+ 2014)☆43Updated 5 years ago
- A simplistic web app for annotating emotions in human speech video recordings.☆28Updated 11 years ago
- Image Captioning with Deep Bidirectional LSTMs☆84Updated last year