auto-tuning momentum SGD optimizer
☆424Jan 9, 2018Updated 8 years ago
Alternatives and similar repositories for YellowFin
Users that are interested in YellowFin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- auto-tuning momentum SGD optimizer☆287Mar 24, 2019Updated 7 years ago
- Tensorflow implementation of DeepFM for CTR prediction.☆2,064Jun 10, 2018Updated 7 years ago
- Modified version of the YellowFin optimizer for TensorFlow to work with the Keras API [not actively maintained]☆16Jul 28, 2017Updated 8 years ago
- Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Netw…☆365Nov 22, 2018Updated 7 years ago
- Tutorials and implementations for "Self-normalizing networks"☆1,587May 12, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Training Very Deep Neural Networks Without Skip-Connections☆589Jun 9, 2018Updated 7 years ago
- Compare SELUs (scaled exponential linear units) with other activations on MNIST, CIFAR10, etc.☆374Nov 1, 2017Updated 8 years ago
- Github repo for my experiments with the orthogonal convolution idea☆22Sep 20, 2017Updated 8 years ago
- 4th Place Solution for Mercari Price Suggestion Competition on Kaggle using DeepFM variant.☆285Jul 24, 2018Updated 7 years ago
- 📉 A collection of TensorBoard-related utilities (In Progress)☆37Nov 17, 2022Updated 3 years ago
- Code for paper "L4: Practical loss-based stepsize adaptation for deep learning"☆122Mar 27, 2019Updated 7 years ago
- TenforFlow Implementation of Neural Factorization Machine☆469Mar 1, 2020Updated 6 years ago
- Code for replicating results in 'On Weight Initializations in Deep Neural Networks'☆10Apr 28, 2017Updated 9 years ago
- Code and models from the paper "Layer Normalization"☆243Nov 8, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of the SAN model for imparting gender privacy to face images☆63Jul 7, 2021Updated 4 years ago
- Tensor Switching Networks☆12Nov 2, 2017Updated 8 years ago
- MobileNet build with Tensorflow☆1,658Nov 6, 2017Updated 8 years ago
- SqueezeNet: AlexNet-level accuracy with 50x fewer parameters☆2,216Jul 9, 2018Updated 7 years ago
- Torch implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).☆130Oct 31, 2017Updated 8 years ago
- A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility☆6,290Aug 6, 2023Updated 2 years ago
- Lattice methods in TensorFlow☆522Jul 30, 2024Updated last year
- DrMAD☆107Nov 12, 2017Updated 8 years ago
- Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)☆2,109Jan 4, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An optimizer that trains as fast as Adam and as good as SGD.☆2,905Jul 23, 2023Updated 2 years ago
- TensorFlow-based neural network library☆9,917May 6, 2026Updated 2 weeks ago
- in progress☆489Jun 2, 2019Updated 6 years ago
- PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM☆1,264Feb 12, 2022Updated 4 years ago
- TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper☆551Mar 7, 2019Updated 7 years ago
- A general-purpose encoder-decoder framework for Tensorflow☆5,629Oct 15, 2020Updated 5 years ago
- Speed up PixelCNN++ image generation by up to a 183 times☆479May 23, 2017Updated 9 years ago
- TensorFlow tutorials and best practices.☆8,591Oct 22, 2020Updated 5 years ago
- Interactive, node-by-node debugging and visualization for TensorFlow☆1,349Jan 27, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Files to create the figures in the paper "Super-Convergence: Very Fast Training of Residual Networks Using Large Learning Rates"☆191Dec 15, 2017Updated 8 years ago
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,124Oct 13, 2017Updated 8 years ago
- Scalable, fast, and lightweight system for large-scale topic modeling☆844Dec 28, 2020Updated 5 years ago
- Summaries and notes on Deep Learning research papers☆4,421Feb 13, 2018Updated 8 years ago
- TensorFlow implementation of an arbitrary order Factorization Machine☆778Jan 17, 2022Updated 4 years ago
- A TensorFlow implementation of the Differentiable Neural Computer.☆2,540Jul 23, 2021Updated 4 years ago
- Improving Convolutional Networks via Attention Transfer (ICLR 2017)☆1,464Jul 11, 2018Updated 7 years ago