znaoya / aenetLinks
AENet: audio feature extraction
☆60Updated 6 years ago
Alternatives and similar repositories for aenet
Users that are interested in aenet are comparing it to the libraries listed below
Sorting:
- Code to demonstrate multimodal LSTM☆36Updated 2 years ago
- ☆59Updated 7 years ago
- Stochastic Adaptive Neural Architecture Search☆65Updated 6 years ago
- TensorFlow implementation of "SoundNet".☆145Updated 7 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 8 years ago
- ☆70Updated 8 years ago
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆116Updated 8 years ago
- The Video2GIF dataset with 100k GIFs from our paper at CVPR2016☆99Updated 8 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆43Updated 8 years ago
- Attention Bidirectional Video Recurrent Net☆56Updated 6 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆51Updated 6 years ago
- ☆29Updated 8 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 6 years ago
- Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…☆44Updated 6 years ago
- Mostly for using the trained weights from https://github.com/ryankiros/visual-semantic-embedding in Keras☆20Updated 9 years ago
- Signal Processing Library for PyTorch☆39Updated 8 years ago
- ZCA whitening in python☆32Updated 6 years ago
- ☆17Updated 7 years ago
- Various implementations and experimentation for deep neural network model compression☆24Updated 7 years ago
- Pytorch implementation of "Fast Training of Triplet-based Deep Binary Embedding Networks".☆40Updated 7 years ago
- Adversarial Discriminative Domain Adaptation with MNIST 64x64 in Lasagne-Theano☆32Updated 8 years ago
- Multimodal Compact Bilinear Pooling for Torch7☆69Updated 8 years ago
- ☆15Updated 8 years ago
- Supplementary material to "Top-down Visual Saliency Guided by Captions" (CVPR 2017)☆107Updated 7 years ago
- Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang☆20Updated 9 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆65Updated 7 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆14Updated 9 years ago
- A dataset with user created GIFs☆49Updated 7 years ago
- Auralisation of learned features in CNN (for audio)☆42Updated 8 years ago
- A pytorch implementation of the paper: "Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks"☆83Updated 7 years ago