loshchil/SGDR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/loshchil/SGDR)

loshchil / SGDR

☆257

Alternatives and similar repositories for SGDR

Users that are interested in SGDR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gaohuang / SnapshotEnsemble
View on GitHub
Snapshot Ensembles in Torch (Snapshot Ensembles: Train 1, Get M for Free)
☆189May 16, 2017Updated 9 years ago
xgastaldi / shake-shake
View on GitHub
2.86% and 15.85% on CIFAR-10 and CIFAR-100
☆296Oct 9, 2018Updated 7 years ago
ryankiros / layer-norm
View on GitHub
Code and models from the paper "Layer Normalization"
☆243Nov 8, 2016Updated 9 years ago
yueatsprograms / Stochastic_Depth
View on GitHub
Deep Networks with Stochastic Depth
☆479Aug 13, 2018Updated 7 years ago
loshchil / AdamW-and-SGDW
View on GitHub
Decoupled Weight Decay Regularization (ICLR 2019)
☆300Jan 9, 2019Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
szagoruyko / wide-residual-networks
View on GitHub
3.8% and 18.3% on CIFAR-10 and CIFAR-100
☆1,314Aug 20, 2019Updated 6 years ago
bigaidream-projects / drmad
View on GitHub
DrMAD
☆106Nov 12, 2017Updated 8 years ago
HIPS / hypergrad
View on GitHub
Exploring differentiation with respect to hyperparameters
☆297Jan 15, 2016Updated 10 years ago
iassael / torch-bnlstm
View on GitHub
Batch-Normalized LSTM (Recurrent Batch Normalization) implementation in Torch.
☆90May 22, 2016Updated 10 years ago
dlaptev / TI-pooling
View on GitHub
TI-pooling: transformation-invariant pooling for feature learning in Convolutional Neural Networks
☆118Aug 25, 2017Updated 8 years ago
CosmosShadow / DNI_Torch
View on GitHub
DNI(Decoupled Neural Interfaces using Synthetic Gradients) implementation with Torch
☆30Aug 30, 2016Updated 9 years ago
keskarnitish / large-batch-training
View on GitHub
Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"
☆147Apr 24, 2017Updated 9 years ago
pranv / lrh
View on GitHub
Learning RNN Hierarchies
☆45Jun 22, 2016Updated 10 years ago
jhkim89 / PyramidNet
View on GitHub
Torch implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).
☆131Oct 31, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
openai / weightnorm
View on GitHub
Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Netw…
☆364Nov 22, 2018Updated 7 years ago
kimiyoung / review_net
View on GitHub
Review Network for Caption Generation
☆181Jan 2, 2018Updated 8 years ago
paulbertens / rank-ordered-autoencoder
View on GitHub
Rank Ordered Autoencoder implementation as described in https://arxiv.org/abs/1605.01749
☆33May 9, 2016Updated 10 years ago
mrkulk / Unsupervised-Capsule-Network
View on GitHub
Capsule network with variations. Originally proposed by Tieleman & Hinton : http://www.cs.toronto.edu/~tijmen/tijmen_thesis.pdf
☆168Nov 1, 2017Updated 8 years ago
ruotianluo / Faster-RCNN-Densecap-torch
View on GitHub
Faster-RCNN based on Densecap(deprecated)
☆84Sep 12, 2016Updated 9 years ago
hardmaru / supercell
View on GitHub
supercell
☆192Oct 9, 2017Updated 8 years ago
xternalz / DelugeNets
View on GitHub
DelugeNets: Deep Networks with Efficient and Flexible Cross-layer Information Inflows
☆26Mar 20, 2017Updated 9 years ago
jiamings / fast-weights
View on GitHub
Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)
☆173Nov 3, 2016Updated 9 years ago
Kaixhin / nninit
View on GitHub
Weight initialisation schemes for Torch7 neural network modules
☆100Jun 21, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ducha-aiki / LSUVinit
View on GitHub
Reference caffe implementation of LSUV initialization
☆114Oct 31, 2017Updated 8 years ago
yangky11 / CNN-Color2Gray
View on GitHub
An implementation of Color2Gray with convolutional neural networks
☆11Dec 23, 2015Updated 10 years ago
coxlab / tsnet
View on GitHub
Tensor Switching Networks
☆12Nov 2, 2017Updated 8 years ago
albanD / adaptive-neural-compilation
View on GitHub
☆58May 26, 2016Updated 10 years ago
gidariss / AttractioNet
View on GitHub
Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization
☆62Feb 12, 2019Updated 7 years ago
edouardoyallon / pyscatwave
View on GitHub
Fast Scattering Transform with CuPy/PyTorch
☆295Feb 22, 2020Updated 6 years ago
deltheil / vlfeat.torch
View on GitHub
VLFeat (partial) FFI wrapper for Torch7
☆12Mar 23, 2016Updated 10 years ago
dblN / stochastic_depth_keras
View on GitHub
Keras implementation for "Deep Networks with Stochastic Depth" http://arxiv.org/abs/1603.09382
☆139Jul 21, 2020Updated 6 years ago
endernewton / PixelNet
View on GitHub
☆29Oct 24, 2016Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
reedscot / nips2016
View on GitHub
Learning What and Where to Draw
☆335Nov 1, 2016Updated 9 years ago
diogo149 / theano_fractional_max_pooling
View on GitHub
Fractional Max Pooling implementation in Theano
☆21Sep 27, 2015Updated 10 years ago
gidariss / LocNet
View on GitHub
LocNet: Improving Localization Accuracy for Object Detection
☆179Oct 15, 2020Updated 5 years ago
kbullaughey / dni-synthetic-gradients
View on GitHub
Torch implementation reproducing MNIST experiments from DeepMind's DNI paper.
☆44Mar 4, 2017Updated 9 years ago
lnsmith54 / super-convergence
View on GitHub
Files to create the figures in the paper "Super-Convergence: Very Fast Training of Residual Networks Using Large Learning Rates"
☆192Dec 15, 2017Updated 8 years ago
nicholas-leonard / drmad
View on GitHub
Hyper-parameter Optimization with DrMAD and Hypero
☆23Jun 9, 2016Updated 10 years ago
facebookresearch / adaptive-softmax
View on GitHub
Implements an efficient softmax approximation as described in the paper "Efficient softmax approximation for GPUs" (http://arxiv.org/abs/…
☆395Mar 22, 2019Updated 7 years ago