jsuarez5341 / Recurrent-Highway-Hypernetworks-NIPS
Cleaned original source code from my NIPS publication
☆154Updated 7 years ago
Alternatives and similar repositories for Recurrent-Highway-Hypernetworks-NIPS:
Users that are interested in Recurrent-Highway-Hypernetworks-NIPS are comparing it to the libraries listed below
- Code Samples from Neural Networks for NLP☆72Updated 7 years ago
- Pytorch implementation of "Forward Thinking: Building and Training Neural Networks One Layer at a Time"☆65Updated 7 years ago
- ☆92Updated 7 years ago
- ☆79Updated 7 years ago
- A pytorch implementation of "Self-Normalizing Neural Networks" by Klambauer et al. (still beta)☆59Updated 7 years ago
- ☆63Updated 7 years ago
- Pytorch implementation of DeepMind's differentiable neural computer paper.☆93Updated 7 years ago
- Tools for PyTorch☆221Updated 2 years ago
- Reference implementation for Structured Prediction with Deep Value Networks☆54Updated 7 years ago
- ☆53Updated 7 years ago
- Weight initialization schemes for PyTorch nn.Modules☆70Updated 8 years ago
- DeepArchitect: Automatically Designing and Training Deep Architectures☆144Updated 5 years ago
- personal notes☆55Updated 7 years ago
- Pytorch implementation of bytenet from "Neural Machine Translation in Linear Time" paper☆46Updated 7 years ago
- Code and models from the paper "Layer Normalization"☆244Updated 8 years ago
- auto-tuning momentum SGD optimizer☆286Updated 5 years ago
- Scrapes the abstracts to NIPS 2017 papers.☆40Updated 7 years ago
- A bare-bones NumPy implementation of "Multimodal Neural Language Models" (Kiros et al, ICML 2014)☆54Updated 8 years ago
- Implementation of Appendix A (Neural Architecture Search with Reinforcement Learning: https://arxiv.org/abs/1611.01578) by chainer☆54Updated 6 years ago
- The first public PyTorch implementation of Attentive Recurrent Comparators☆147Updated 7 years ago
- Implementation of "Learning with Random Learning Rates" in PyTorch.☆102Updated 5 years ago
- Code for Emergent Translation in Multi-Agent Communication☆80Updated 6 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆148Updated 7 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆110Updated 7 years ago
- Tensorflow Implementation on "The Cramer Distance as a Solution to Biased Wasserstein Gradients" (https://arxiv.org/pdf/1705.10743.pdf)☆125Updated 7 years ago
- A PyTorch implementation of Recurrent Additive Networks by Lee et al. (2017)☆29Updated 7 years ago
- ☆44Updated 7 years ago
- Code for Attentive Recurrent Comparators☆57Updated 8 years ago
- Implement Decoupled Neural Interfaces using Synthetic Gradients in Pytorch☆119Updated 7 years ago
- Code for experiments with our RNN regularizer, which stochastically forces units to maintain previous values.☆78Updated 7 years ago