jsuarez5341 / Recurrent-Highway-Hypernetworks-NIPS
Cleaned original source code from my NIPS publication
☆154Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Recurrent-Highway-Hypernetworks-NIPS
- Code Samples from Neural Networks for NLP☆73Updated 7 years ago
- Pytorch implementation of "Forward Thinking: Building and Training Neural Networks One Layer at a Time"☆65Updated 7 years ago
- ☆78Updated 6 years ago
- Tools for PyTorch☆221Updated 2 years ago
- ☆64Updated 7 years ago
- auto-tuning momentum SGD optimizer☆287Updated 5 years ago
- ☆53Updated 7 years ago
- Weight initialization schemes for PyTorch nn.Modules☆70Updated 7 years ago
- ☆93Updated 7 years ago
- The first public PyTorch implementation of Attentive Recurrent Comparators☆148Updated 7 years ago
- Pytorch implementation of DeepMind's differentiable neural computer paper.☆94Updated 6 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆111Updated 7 years ago
- personal notes☆56Updated 7 years ago
- Implementation of Appendix A (Neural Architecture Search with Reinforcement Learning: https://arxiv.org/abs/1611.01578) by chainer☆55Updated 6 years ago
- A pytorch implementation of "Self-Normalizing Neural Networks" by Klambauer et al. (still beta)☆59Updated 7 years ago
- Implementation of "Learning with Random Learning Rates" in PyTorch.☆102Updated 5 years ago
- Code and models from the paper "Layer Normalization"☆245Updated 8 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆148Updated 7 years ago
- A curated list of awesome hyperparameters for deep learning☆78Updated 7 years ago
- Pytorch implementation of bytenet from "Neural Machine Translation in Linear Time" paper☆47Updated 6 years ago
- DeepArchitect: Automatically Designing and Training Deep Architectures☆144Updated 5 years ago
- Tensorflow Implementation of PathNet: Evolution Channels Gradient Descent in Super Neural Networks☆102Updated 7 years ago
- Code for the Eager Translation Model from the paper You May Not Need Attention☆293Updated 5 years ago
- Implement Decoupled Neural Interfaces using Synthetic Gradients in Pytorch☆118Updated 7 years ago
- 🏃 Implementation of Using Fast Weights to Attend to the Recent Past.☆268Updated 5 years ago
- Dynamic evaluation for pytorch language models, now includes hyperparameter tuning☆105Updated 6 years ago
- Reference implementation for Structured Prediction with Deep Value Networks☆55Updated 7 years ago