eliben / cs344
Introduction to Parallel Programming class code
☆31Updated 9 years ago
Related projects: ⓘ
- Symbolic differentiation engine for optimization-based machine learning models.☆42Updated 6 years ago
- ☆62Updated this week
- Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.☆70Updated 4 years ago
- A portable high-level API with CUDA or OpenCL back-end☆53Updated 6 years ago
- Parallel Algorithm Scheduling Library☆101Updated 7 years ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Updated 10 years ago
- Fork of magma to include more BLAS☆28Updated 7 years ago
- A collection of resources on modern C++☆48Updated last year
- The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…☆27Updated 13 years ago
- Proof-of-Concept CNN in Halide☆21Updated 8 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 5 years ago
- My solutions to Udacity's Parallel Programming course (CS 344)☆76Updated 7 years ago
- Simple and Cutting-edge Deep Learning Library accelerated with GPU using C++ AMP☆19Updated 8 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- CMSC 12300 - Computer Science with Applications 3☆75Updated 7 years ago
- Machine Learning Benchmark Scripts☆100Updated 4 years ago
- C++11 std::async☆58Updated 11 years ago
- BLAS OpenCL implementation.☆15Updated 9 years ago
- Windows Visual Studio Solutions for class "Introduction to Parallel Programming"☆20Updated 5 years ago
- Launching collective tasks in bulk☆36Updated 4 years ago
- Example code used in the CVPR 2015 tutorial☆38Updated 8 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆63Updated 5 years ago
- CMake Examples (CMake, CMake+CUDA, CMake+CUDA+PandaRoot)☆41Updated 11 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- Simple C++ reader for MNIST dataset☆83Updated 5 years ago
- C++ 11 implementation of Geoff Hinton's Deep Learning matlab code☆284Updated 9 years ago
- Deep neural network framework (C/C++/CUDA).☆31Updated 9 years ago
- Corrected source for the OpenCL in Action book (work in progress)☆60Updated 11 years ago
- Parallel network flows using OpenMP and CUDA.☆27Updated 5 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 5 years ago