negrinho / deep_architect_legacy
DeepArchitect: Automatically Designing and Training Deep Architectures
☆144Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for deep_architect_legacy
- Efficient layer normalization GPU kernel for Tensorflow☆111Updated 7 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆148Updated 7 years ago
- Code for Attentive Recurrent Comparators☆57Updated 7 years ago
- Forward-mode Automatic Differentiation for TensorFlow☆140Updated 6 years ago
- Reference caffe implementation of LSUV initialization☆112Updated 7 years ago
- Cleaned original source code from my NIPS publication☆154Updated 6 years ago
- auto-tuning momentum SGD optimizer☆287Updated 5 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆43Updated 7 years ago
- Pytorch implementation of "Forward Thinking: Building and Training Neural Networks One Layer at a Time"☆65Updated 7 years ago
- Code for paper "L4: Practical loss-based stepsize adaptation for deep learning"☆124Updated 5 years ago
- Tensorflow Implementation on "The Cramer Distance as a Solution to Biased Wasserstein Gradients" (https://arxiv.org/pdf/1705.10743.pdf)☆125Updated 6 years ago
- ☆69Updated 5 years ago
- A pytorch implementation of "Self-Normalizing Neural Networks" by Klambauer et al. (still beta)☆59Updated 7 years ago
- Implementation of "Learning with Random Learning Rates" in PyTorch.☆102Updated 5 years ago
- Implementation of Appendix A (Neural Architecture Search with Reinforcement Learning: https://arxiv.org/abs/1611.01578) by chainer☆55Updated 6 years ago
- Implementation of http://arxiv.org/abs/1511.05641 that lets one build a larger net starting from a smaller one.☆160Updated 8 years ago
- ☆78Updated 6 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)☆172Updated 8 years ago
- ☆69Updated 7 years ago
- Deep Unsupervised Perceptual Grouping☆131Updated 4 years ago
- A new kind of pooling layer for faster and sharper convergence☆76Updated 7 years ago
- The first public PyTorch implementation of Attentive Recurrent Comparators☆148Updated 7 years ago
- Weight initialization schemes for PyTorch nn.Modules☆70Updated 7 years ago
- Neural network training using iterated projections.☆89Updated 7 years ago
- Lasagne code for weight normalization☆87Updated 8 years ago
- DrMAD☆108Updated 7 years ago
- ☆53Updated 7 years ago
- Montréal Deep Learning Summer School 2016 material☆100Updated 8 years ago
- Pytorch implementation of DeepMind's differentiable neural computer paper.☆94Updated 6 years ago