eliben / cs344
Introduction to Parallel Programming class code
☆30Updated 10 years ago
Alternatives and similar repositories for cs344:
Users that are interested in cs344 are comparing it to the libraries listed below
- Symbolic differentiation engine for optimization-based machine learning models.☆42Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…☆27Updated 13 years ago
- Deep neural network framework (C/C++/CUDA).☆31Updated 9 years ago
- TH++, C++ interface to the torch7 TH library☆238Updated 6 years ago
- Sample implementation of a proposed C++ hashing framework☆29Updated 9 years ago
- My solutions to Udacity's Parallel Programming course (CS 344)☆75Updated 7 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 6 years ago
- related materials for coursera & edx MOOCs, will no longer update.☆63Updated 9 years ago
- C++ 11 implementation of Geoff Hinton's Deep Learning matlab code☆284Updated 9 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- Parallel Algorithm Scheduling Library☆106Updated 7 years ago
- CMake Examples (CMake, CMake+CUDA, CMake+CUDA+PandaRoot)☆41Updated 11 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- (Spring 2017) Assignment 2: GPU Executor☆62Updated 7 years ago
- Scientific library for high-precision computations and research☆49Updated 7 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 7 years ago
- Logistic regression engine for medium-sized data☆55Updated 9 years ago
- Papers and blogs related to distributed deep learning☆96Updated 7 years ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Updated 11 years ago
- This project is a simple deep neural network trained using only TensorFlow C++.☆118Updated last year
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 8 years ago
- CMake module collection☆30Updated 10 years ago
- C++ Concurrency in Action - Practical Multithreading☆59Updated 8 years ago
- Matrix library for CUDA in C++ and Python☆195Updated 8 years ago
- ☆154Updated 8 years ago
- Direct C++ Interface to PyTorch☆80Updated 6 years ago
- C++11 std::async☆58Updated 12 years ago
- This repository contains easy-to-read Python/CUDA implementations of fundamental GPU computing primitives.☆36Updated 9 years ago