eliben / cs344
Introduction to Parallel Programming class code
☆31Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for cs344
- Symbolic differentiation engine for optimization-based machine learning models.☆42Updated 7 years ago
- Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.☆71Updated 5 years ago
- Proof-of-Concept CNN in Halide☆21Updated 8 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- My solutions to Udacity's Parallel Programming course (CS 344)☆75Updated 7 years ago
- Fork of magma to include more BLAS☆28Updated 7 years ago
- Windows Visual Studio Solutions for class "Introduction to Parallel Programming"☆19Updated 6 years ago
- Simple examples for extending Python with C/C++☆11Updated 8 years ago
- CNNs in Halide☆23Updated 9 years ago
- CMSC 12300 - Computer Science with Applications 3☆75Updated 7 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 5 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 7 years ago
- BLAS OpenCL implementation.☆15Updated 9 years ago
- A Light-weight and Fast Template Matrix Library☆131Updated 11 years ago
- This repository contains easy-to-read Python/CUDA implementations of fundamental GPU computing primitives.☆36Updated 9 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Convolution op for Theano based on CuFFT using scikits.cuda☆51Updated 10 years ago
- Sample implementation of a proposed C++ hashing framework☆29Updated 9 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆263Updated last year
- Parallel Algorithm Scheduling Library☆103Updated 7 years ago
- Full-speed Array of Structures access☆161Updated last year
- Vector Math Library☆75Updated 7 years ago
- ☆75Updated last year
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆63Updated 5 years ago
- Logistic regression engine for medium-sized data☆55Updated 9 years ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Updated 11 years ago
- Parallel network flows using OpenMP and CUDA.☆27Updated 6 years ago
- profiling gemm on android☆10Updated 8 years ago