sol-prog / Sort_data_parallel
☆12Updated 12 years ago
Alternatives and similar repositories for Sort_data_parallel
Users that are interested in Sort_data_parallel are comparing it to the libraries listed below
Sorting:
- (Spring 2017) Assignment 2: GPU Executor☆62Updated 8 years ago
- ☆15Updated 7 years ago
- Training a Tensorflow graph in C++☆25Updated 8 years ago
- LASSO is a parallel regression model learning system☆69Updated 11 years ago
- Flattened convolutional neural networks (1D convolution modules for Torch nn)☆61Updated 9 years ago
- Implementing (parts of) TensorFlow (almost) from Scratch☆30Updated 7 years ago
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13Updated 10 years ago
- Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends☆178Updated 6 years ago
- C++ library [machine learning & numerical optimization] - superseeded by libnano☆1Updated 6 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 9 years ago
- Matrix library for CUDA in C++ and Python☆195Updated 8 years ago
- Deep neural network framework (C/C++/CUDA).☆31Updated 9 years ago
- Fast binary matrix product on CPU☆10Updated 9 years ago
- a C++ wrapper of Caffe and mxnet to make predictions☆49Updated 7 years ago
- Long Short-Term Memory Recurrent Neural Networks☆26Updated 9 years ago
- Simple examples for extending Python with C/C++☆11Updated 8 years ago
- AI Final Project☆65Updated 9 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 8 years ago
- C++ 11 implementation of Geoff Hinton's Deep Learning matlab code☆284Updated 9 years ago
- GPU/CPU (CUDA) Implementation of "Recurrent Memory Array Structures", Simple RNN, LSTM, Array LSTM..☆25Updated 5 years ago
- TH++, C++ interface to the torch7 TH library☆238Updated 6 years ago
- MPI Parallel framework for training deep learning models built in Theano☆54Updated 7 years ago
- Common Code Workflow tutorial on Theano☆16Updated 9 years ago
- Benchmarking matrix multiplication implementations☆98Updated 8 years ago
- profiling gemm on android☆10Updated 9 years ago
- Custom fork containing our own python backend for integration into neon☆15Updated 2 years ago
- Tools to convert Caffe models to neon's serialization format☆39Updated 2 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- A GPU / CPU implementation of a feed forward neural network☆31Updated 10 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆33Updated 8 years ago