sol-prog / Sort_data_parallelLinks
☆12Updated 12 years ago
Alternatives and similar repositories for Sort_data_parallel
Users that are interested in Sort_data_parallel are comparing it to the libraries listed below
Sorting:
- TH++, C++ interface to the torch7 TH library☆246Updated 7 years ago
- Nervana GPU library☆49Updated 10 years ago
- LASSO is a parallel regression model learning system☆69Updated 12 years ago
- C++ 11 implementation of Geoff Hinton's Deep Learning matlab code☆286Updated 10 years ago
- Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends☆182Updated 7 years ago
- Facebook's CUDA extensions.☆284Updated 6 years ago
- (Spring 2017) Assignment 2: GPU Executor☆63Updated 8 years ago
- ☆81Updated 7 years ago
- An open source library for artificial neural networks.☆122Updated 5 years ago
- A CUDA implementation of the k-means clustering algorithm☆255Updated 13 years ago
- Computation using data flow graphs for scalable machine learning☆35Updated 8 years ago
- Custom fork containing our own python backend for integration into neon☆15Updated 3 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆300Updated 7 years ago
- A GPU implementation of Convolutional Neural Nets in C++☆505Updated 5 years ago
- My fork of Alex Krizhevsky's cuda-convnet from 2013 where I added dropout, among other features.☆259Updated 11 years ago
- C++ interface for mxnet☆115Updated 8 years ago
- Implements a message passing interface (MPI) wrapper that makes it easy to do massively parallel computations inside the Torch deep-learn…☆110Updated 6 years ago
- Matrix library for CUDA in C++ and Python☆196Updated 9 years ago
- Randomized Decision Trees: A Fast C++ Implementation of Random Forests.☆179Updated 5 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 9 years ago
- Ian Goodfellow, Yoshua Bengio and Aaron Courville's deep learning book Chinese translation☆55Updated 4 years ago
- GPU-accelerated LIBSVM is a modification of the original LIBSVM that exploits the CUDA framework to significantly reduce processing time …☆219Updated 8 years ago
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13Updated 10 years ago
- ☆154Updated 9 years ago
- PMLS-Caffe: Distributed Deep Learning Framework for Parallel ML System☆193Updated 7 years ago
- clang with OpenMP 3.1 and some elements of OpenMP 4.0 support☆91Updated 10 years ago
- ☆15Updated 7 years ago
- A C++ implementaton of MapReduce without distributed filesystem☆267Updated 9 years ago
- Introduction to Parallel Programming class code☆30Updated 10 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆33Updated 9 years ago