dlsys-course / assignment2-2017
(Spring 2017) Assignment 2: GPU Executor
☆62Updated 7 years ago
Alternatives and similar repositories for assignment2-2017:
Users that are interested in assignment2-2017 are comparing it to the libraries listed below
- Just-in-time Dynamic Batching with MXNet Gluon.☆52Updated 4 years ago
- Implementing (parts of) TensorFlow (almost) from Scratch☆30Updated 7 years ago
- Deep reinforcement learning with TensorFlow☆47Updated 7 years ago
- FRED simulator and associated paper☆26Updated 9 years ago
- This is my original repository of the decaf code. Decaf is a precursor of Caffe written in Python for deep image classification. It is de…☆43Updated 11 years ago
- Coding example of DLIF tutorial☆66Updated 8 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 8 years ago
- Benchmarks for several RNN variations with different deep-learning frameworks☆169Updated 5 years ago
- Papers and blogs related to distributed deep learning☆96Updated 7 years ago
- MXNet Model Serving☆25Updated 7 years ago
- Flattened convolutional neural networks (1D convolution modules for Torch nn)☆61Updated 9 years ago
- ☆15Updated 6 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆110Updated 7 years ago
- Example codes appears in lectures☆23Updated 3 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆125Updated 7 years ago
- MXNet Tutorial for NVidia GTC 2016.☆130Updated 8 years ago
- (Spring 2018) Assignment 2: Graph Executor with TVM☆124Updated 6 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆33Updated 8 years ago
- ☆29Updated 9 years ago
- The tensorflow implementation of NIPS2016 paper "LightRNN: Memory and Computation-Efficient Recurrent Neural Networks" (https://arxiv.org…☆56Updated 8 years ago
- Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM☆49Updated 7 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆43Updated 8 years ago
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13Updated 9 years ago
- ☆35Updated 7 years ago
- Cyclades☆28Updated 6 years ago
- ☆58Updated 8 years ago
- ☆12Updated 6 years ago
- An implementation of Highway Networks in Caffe☆95Updated 9 years ago
- C++ code for "A Faster Drop-in Implementation for Leaf-wise Exact Greedy Induction of Decision Tree Using Pre-sorted Deque"☆36Updated last year
- MultiGPU enabled image generative models (GAN and DCGAN)☆207Updated 4 years ago