dmlc / mshadowLinks
Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
☆1,117Updated 6 years ago
Alternatives and similar repositories for mshadow
Users that are interested in mshadow are comparing it to the libraries listed below
Sorting:
- A common bricks library for building scalable and portable distributed machine learning.☆894Updated last week
- ☆1,655Updated 7 years ago
- move forward to https://github.com/dmlc/mxnet☆1,026Updated 10 years ago
- moved to https://github.com/dmlc/ps-lite☆647Updated 10 years ago
- Minerva: a fast and flexible tool for deep learning on multi-GPU. It provides ndarray programming interface, just like Numpy. Python bind…☆705Updated 6 years ago
- Reliable Allreduce and Broadcast Interface for distributed machine learning☆514Updated 4 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,699Updated last year
- Fast Recurrent Networks Library☆577Updated 9 years ago
- Parameter server framework for distributed machine learning☆778Updated 6 years ago
- ATen: A TENsor library for C++11☆704Updated 5 years ago
- ☆371Updated 7 years ago
- PMLS-Caffe: Distributed Deep Learning Framework for Parallel ML System☆193Updated 7 years ago
- Automatically exported from code.google.com/p/cuda-convnet2☆812Updated 9 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,018Updated 7 years ago
- A GPU implementation of Convolutional Neural Nets in C++☆504Updated 5 years ago
- A lightweight parameter server interface☆1,560Updated 2 years ago
- Purified Purine.☆256Updated 10 years ago
- Fast Deep Learning Library (DLL) for C++ (ANNs, CNNs, RBMs, DBNs...)☆685Updated 4 months ago
- My fork of Alex Krizhevsky's cuda-convnet from 2013 where I added dropout, among other features.☆259Updated 10 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆298Updated 6 years ago
- Parallel ML System - Bosen project☆963Updated last year
- ☆599Updated 7 years ago
- Low-precision matrix multiplication☆1,817Updated last year
- Deprecated☆337Updated 8 years ago
- Open single and half precision gemm implementations☆392Updated 2 years ago
- Benchmarking Deep Learning operations on different hardware☆1,096Updated 4 years ago
- GPU-accelerated LIBSVM is a modification of the original LIBSVM that exploits the CUDA framework to significantly reduce processing time …☆218Updated 8 years ago
- Some handy utility libraries and tools for the Caffe deep learning framework.☆459Updated 6 years ago
- LSTM implementation on Caffe