dmlc / mshadowLinks
Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
☆1,119Updated 6 years ago
Alternatives and similar repositories for mshadow
Users that are interested in mshadow are comparing it to the libraries listed below
Sorting:
- A common bricks library for building scalable and portable distributed machine learning.☆882Updated this week
- ☆1,655Updated 7 years ago
- move forward to https://github.com/dmlc/mxnet☆1,026Updated 10 years ago
- Minerva: a fast and flexible tool for deep learning on multi-GPU. It provides ndarray programming interface, just like Numpy. Python bind…☆705Updated 7 years ago
- Reliable Allreduce and Broadcast Interface for distributed machine learning☆512Updated 5 years ago
- Fast Recurrent Networks Library☆578Updated 9 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,702Updated last year
- moved to https://github.com/dmlc/ps-lite☆648Updated 10 years ago
- ATen: A TENsor library for C++11☆710Updated 6 years ago
- Assignment 1: automatic differentiation☆475Updated 6 years ago
- Parameter server framework for distributed machine learning☆779Updated 6 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,016Updated 7 years ago
- Automatically exported from code.google.com/p/cuda-convnet2☆816Updated 10 years ago
- ☆370Updated 8 years ago
- PMLS-Caffe: Distributed Deep Learning Framework for Parallel ML System☆193Updated 7 years ago
- Deprecated☆337Updated 8 years ago
- A GPU implementation of Convolutional Neural Nets in C++☆505Updated 5 years ago
- Open single and half precision gemm implementations☆393Updated 2 years ago
- ☆600Updated 7 years ago
- A lightweight parameter server interface☆1,558Updated 2 years ago
- LSTM implementation on Caffe☆492Updated 9 years ago
- Purified Purine.☆256Updated 10 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆299Updated 7 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆268Updated 2 years ago
- My fork of Alex Krizhevsky's cuda-convnet from 2013 where I added dropout, among other features.☆258Updated 10 years ago
- Benchmarking Deep Learning operations on different hardware☆1,104Updated 4 years ago
- Low-precision matrix multiplication☆1,821Updated last year
- Parallel ML System - Bosen project☆961Updated last year
- Deep learning system course☆214Updated 6 years ago
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆529Updated 7 years ago