dlsys-course / assignment2-2017
(Spring 2017) Assignment 2: GPU Executor
☆63Updated 7 years ago
Alternatives and similar repositories for assignment2-2017:
Users that are interested in assignment2-2017 are comparing it to the libraries listed below
- Just-in-time Dynamic Batching with MXNet Gluon.☆52Updated 4 years ago
- Deep reinforcement learning with TensorFlow☆47Updated 7 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 8 years ago
- (Spring 2018) Assignment 2: Graph Executor with TVM☆124Updated 6 years ago
- Flattened convolutional neural networks (1D convolution modules for Torch nn)☆61Updated 9 years ago
- This is my original repository of the decaf code. Decaf is a precursor of Caffe written in Python for deep image classification. It is de…☆43Updated 11 years ago
- MXNet Tutorial for NVidia GTC 2016.☆131Updated 8 years ago
- Sequence to sequence learning with MXNET☆50Updated 8 years ago
- Example codes appears in lectures☆23Updated 3 years ago
- FRED simulator and associated paper☆26Updated 9 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆126Updated 7 years ago
- Papers and blogs related to distributed deep learning☆97Updated 7 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 8 years ago
- a mxnet multi-task tutorial☆33Updated 8 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆33Updated 8 years ago
- Coding example of DLIF tutorial☆66Updated 7 years ago
- MultiGPU enabled image generative models (GAN and DCGAN)☆208Updated 4 years ago
- Deep learning system course☆218Updated 6 years ago
- Benchmarks for several RNN variations with different deep-learning frameworks☆169Updated 5 years ago
- ☆16Updated 6 years ago
- Distributed LDA, takes raw text as input and outputs topic word table.☆16Updated 8 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆110Updated 7 years ago
- C++ code for "A Faster Drop-in Implementation for Leaf-wise Exact Greedy Induction of Decision Tree Using Pre-sorted Deque"☆36Updated last year
- Implementing (parts of) TensorFlow (almost) from Scratch☆30Updated 7 years ago
- DNI(Decoupled Neural Interfaces using Synthetic Gradients) implementation with Torch☆29Updated 8 years ago
- The tensorflow implementation of NIPS2016 paper "LightRNN: Memory and Computation-Efficient Recurrent Neural Networks" (https://arxiv.org…☆56Updated 7 years ago
- MXNet implementation of Deep Q-learning☆34Updated 7 years ago
- The code to learn mxnet☆60Updated 8 years ago
- mpi-caffe☆49Updated 5 years ago