eliben / cs344Links
Introduction to Parallel Programming class code
☆30Updated 10 years ago
Alternatives and similar repositories for cs344
Users that are interested in cs344 are comparing it to the libraries listed below
Sorting:
- This repository contains easy-to-read Python/CUDA implementations of fundamental GPU computing primitives.☆36Updated 10 years ago
- Symbolic differentiation engine for optimization-based machine learning models.☆43Updated 8 years ago
- Windows Visual Studio Solutions for class "Introduction to Parallel Programming"☆19Updated 7 years ago
- C++ 11 implementation of Geoff Hinton's Deep Learning matlab code☆286Updated 10 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 6 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.☆72Updated 6 years ago
- This project is a simple deep neural network trained using only TensorFlow C++.☆117Updated 2 years ago
- Proof-of-Concept CNN in Halide☆22Updated 9 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆269Updated 2 years ago
- A lightweight and user friendly C++ library for deep and convolutional neural network with GPU acceleration☆347Updated 9 years ago
- My solutions to Udacity's Parallel Programming course (CS 344)☆75Updated 8 years ago
- Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends☆182Updated 6 years ago
- tutorial to optimize GEMM performance on android☆51Updated 9 years ago
- Fork of magma to include more BLAS☆28Updated 9 years ago
- neon tutorials☆93Updated 2 years ago
- Parallel Algorithm Scheduling Library☆107Updated 8 years ago
- Deep neural network framework (C/C++/CUDA).☆32Updated 10 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆142Updated 8 years ago
- Ian Goodfellow, Yoshua Bengio and Aaron Courville's deep learning book Chinese translation☆55Updated 4 years ago
- CNNs in Halide☆23Updated 10 years ago
- My fork of Alex Krizhevsky's cuda-convnet from 2013 where I added dropout, among other features.☆258Updated 10 years ago
- Papers and blogs related to distributed deep learning☆96Updated 8 years ago
- Randomized Decision Trees: A Fast C++ Implementation of Random Forests.☆179Updated 5 years ago
- A CUDA implementation of the PageRank Pipeline Benchmark☆33Updated 8 years ago
- A portable high-level API with CUDA or OpenCL back-end☆55Updated 8 years ago
- A collection of resources on modern C++☆47Updated 2 years ago
- Slides and code for my talk at MeetingC++ 2017☆48Updated 8 years ago
- Example codes appears in lectures☆23Updated 3 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆299Updated 6 years ago