callowbird / Harmonica
Code repository for the paper "Hyperparameter Optimization: A Spectral Approach" by Elad Hazan, Adam Klivans, Yang Yuan.
☆173Updated 6 years ago
Alternatives and similar repositories for Harmonica:
Users that are interested in Harmonica are comparing it to the libraries listed below
- Exploring differentiation with respect to hyperparameters☆295Updated 9 years ago
- auto-tuning momentum SGD optimizer☆287Updated 6 years ago
- Deep learning system course☆216Updated 6 years ago
- Reference caffe implementation of LSUV initialization☆113Updated 7 years ago
- DrMAD☆107Updated 7 years ago
- auto-tuning momentum SGD optimizer☆423Updated 7 years ago
- DeepArchitect: Automatically Designing and Training Deep Architectures☆145Updated 5 years ago
- https://2017.icml.cc/Conferences/2017/Schedule☆72Updated 7 years ago
- Hyperparameter optimization with approximate gradient☆66Updated 4 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆43Updated 8 years ago
- ☆251Updated 8 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 7 years ago
- 2.86% and 15.85% on CIFAR-10 and CIFAR-100☆295Updated 6 years ago
- Fully differentiable deep-neural decision forest in tensorflow☆229Updated 7 years ago
- Implements pytorch code for the Accelerated SGD algorithm.☆215Updated 7 years ago
- An example of data parallelism and async updates of parameter in tensorflow.☆121Updated 6 years ago
- Standalone TensorBoard for visualizing in deep learning☆368Updated 5 years ago
- Optimizers for machine learning☆183Updated last year
- Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"☆139Updated 8 years ago
- Code for paper "L4: Practical loss-based stepsize adaptation for deep learning"☆125Updated 6 years ago
- A PyTorch implementation of a Factorization Machine module in cython.☆171Updated 7 years ago
- A new kind of pooling layer for faster and sharper convergence☆76Updated 7 years ago
- repo that holds code for improving on dropout using Stochastic Delta Rule☆142Updated 6 years ago
- Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)☆104Updated 6 years ago
- Videos of deep learning optimizers moving on 3D problem-landscapes☆107Updated 9 months ago
- Compare SELUs (scaled exponential linear units) with other activations on MNIST, CIFAR10, etc.☆375Updated 7 years ago
- Forward-mode Automatic Differentiation for TensorFlow☆139Updated 7 years ago
- Functional ANOVA☆123Updated last month
- Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Netw…☆365Updated 6 years ago
- MultiGPU enabled image generative models (GAN and DCGAN)☆206Updated 4 years ago