dmlc / mshadow
Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
☆1,109Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for mshadow
- A common bricks library for building scalable and portable distributed machine learning.☆865Updated 4 months ago
- ☆1,655Updated 6 years ago
- move forward to https://github.com/dmlc/mxnet☆1,026Updated 9 years ago
- Minerva: a fast and flexible tool for deep learning on multi-GPU. It provides ndarray programming interface, just like Numpy. Python bind…☆698Updated 6 years ago
- moved to https://github.com/dmlc/ps-lite☆649Updated 9 years ago
- Reliable Allreduce and Broadcast Interface for distributed machine learning☆506Updated 4 years ago
- Fast Recurrent Networks Library☆574Updated 8 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,674Updated 4 months ago
- A lightweight parameter server interface☆1,539Updated last year
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆290Updated 5 years ago
- LSTM implementation on Caffe☆493Updated 8 years ago
- PMLS-Caffe: Distributed Deep Learning Framework for Parallel ML System☆194Updated 6 years ago
- Deprecated☆338Updated 7 years ago
- ATen: A TENsor library for C++11☆683Updated 4 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,001Updated 6 years ago
- Parameter server framework for distributed machine learning☆774Updated 5 years ago
- Some handy utility libraries and tools for the Caffe deep learning framework.☆457Updated 5 years ago
- Parallel ML System - Bosen project☆962Updated 9 months ago
- A GPU implementation of Convolutional Neural Nets in C++☆506Updated 4 years ago
- Facebook's CUDA extensions.☆282Updated 5 years ago
- My fork of Alex Krizhevsky's cuda-convnet from 2013 where I added dropout, among other features.☆254Updated 9 years ago
- Marvin: A Minimalist GPU-only N-Dimensional ConvNets Framework☆421Updated 6 years ago
- Low-precision matrix multiplication☆1,780Updated 9 months ago
- THE Deep Learning Benchmarks☆352Updated 8 years ago
- Purified Purine.☆256Updated 9 years ago
- Easy benchmarking of all publicly accessible implementations of convnets☆2,682Updated 7 years ago
- Deep learning system course☆218Updated 5 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆308Updated 7 years ago