znaoya / aenetLinks
AENet: audio feature extraction
☆60Updated 6 years ago
Alternatives and similar repositories for aenet
Users that are interested in aenet are comparing it to the libraries listed below
Sorting:
- Code to demonstrate multimodal LSTM☆36Updated 2 years ago
- Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)☆43Updated 8 years ago
- Stochastic Adaptive Neural Architecture Search☆65Updated 7 years ago
- ☆59Updated 8 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 8 years ago
- The Video2GIF dataset with 100k GIFs from our paper at CVPR2016☆100Updated 8 years ago
- Pytorch implementation of "Fast Training of Triplet-based Deep Binary Embedding Networks".☆40Updated 8 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆51Updated 6 years ago
- Various implementations and experimentation for deep neural network model compression☆24Updated 7 years ago
- Attention Bidirectional Video Recurrent Net☆56Updated 6 years ago
- TensorFlow implementation of "SoundNet".☆145Updated 7 years ago
- Mostly for using the trained weights from https://github.com/ryankiros/visual-semantic-embedding in Keras☆20Updated 9 years ago
- Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…☆44Updated 6 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 6 years ago
- ☆29Updated 8 years ago
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆115Updated 8 years ago
- ZCA whitening in python☆33Updated 6 years ago
- ☆70Updated 8 years ago
- A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch☆124Updated 7 years ago
- Signal Processing Library for PyTorch☆39Updated 8 years ago
- 4th place solution to Google Cloud & YouTube-8M Video Understanding Challenge☆26Updated 8 years ago
- Representations of language in a model of visually grounded speech signal.☆23Updated 7 years ago
- A replication of Google DeepMind's paper End-to-End Sentence-level Lipreading☆28Updated 8 years ago
- ☆15Updated 8 years ago
- Pytorch implement WaveNet☆93Updated 7 years ago
- A PyTorch implementation of fast-wavenet☆92Updated 7 years ago
- Code for experiments with our RNN regularizer, which stochastically forces units to maintain previous values.☆78Updated 8 years ago
- ☆49Updated 2 years ago
- Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang☆20Updated 9 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆65Updated 7 years ago