eliben / cs344Links

Introduction to Parallel Programming class code

☆30

Alternatives and similar repositories for cs344

Users that are interested in cs344 are comparing it to the libraries listed below

Sorting:

OneRaynyDay / autodiff
Symbolic differentiation engine for optimization-based machine learning models.
☆43Updated 7 years ago
ppwwyyxx / haDNN
Proof-of-Concept CNN in Halide
☆22Updated 9 years ago
milakov / nnForge
Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends
☆181Updated 6 years ago
xingdi-eric-yuan / cuda-deep-neural-nets
Deep neural network framework (C/C++/CUDA).
☆32Updated 9 years ago
NervanaSystems / neon_course
neon tutorials
☆93Updated 2 years ago
bjoern-andres / random-forest
Randomized Decision Trees: A Fast C++ Implementation of Random Forests.
☆179Updated 4 years ago
tmulc18 / DistributedDeepLearningReads
Papers and blogs related to distributed deep learning
☆96Updated 7 years ago
jdeng / rbm-mnist
C++ 11 implementation of Geoff Hinton's Deep Learning matlab code
☆285Updated 9 years ago
attractivechaos / matmul
Benchmarking matrix multiplication implementations
☆100Updated 8 years ago
naibaf7 / libdnn
Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL
☆136Updated 8 years ago
mikeroberts3000 / GpuComputing
This repository contains easy-to-read Python/CUDA implementations of fundamental GPU computing primitives.
☆36Updated 10 years ago
ibebrett / CUDA-CS344
My solutions to Udacity's Parallel Programming course (CS 344)
☆75Updated 8 years ago
HFTrader / awesome-modern-cpp
A collection of resources on modern C++
☆47Updated 2 years ago
hannes-brt / cudnn-python-wrappers
Python wrappers for the NVIDIA cuDNN libraries
☆140Updated 8 years ago
theflofly / dnn_tensorflow_cpp
This project is a simple deep neural network trained using only TensorFlow C++.
☆117Updated last year
ashwin / coursera-heterogeneous
Resources to work offline on the assignments of Heterogenous Parallel Programming course from Coursera.
☆72Updated 5 years ago
tatsy / educnn
Simple implementation of CNN (convolutional neural network) with precise-comments.
☆37Updated 3 years ago
strin / gemm-android
tutorial to optimize GEMM performance on android
☆51Updated 9 years ago
CNugteren / CLCudaAPI
A portable high-level API with CUDA or OpenCL back-end
☆54Updated 7 years ago
OpenMathLib / OpenVML
Vector Math Library
☆79Updated 3 weeks ago
peter-can-talk / meeting-cpp-2017
Slides and code for my talk at MeetingC++ 2017
☆48Updated 7 years ago
deepsea-inria / pasl
Parallel Algorithm Scheduling Library
☆106Updated 8 years ago
uchicago-cs / cmsc12300
CMSC 12300 - Computer Science with Applications 3
☆74Updated 8 years ago
Shark-ML / Remora
High performance C++ Linear Algebra Library
☆15Updated 4 years ago
facebookarchive / thpp
TH++, C++ interface to the torch7 TH library
☆238Updated 7 years ago
Maratyszcza / FPplus
Scientific library for high-precision computations and research
☆49Updated 7 years ago
masahi / nnvm-vision-demo
Demos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM
☆49Updated 7 years ago
NVIDIA / cnmem
A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory
☆298Updated 6 years ago
cjang / GATLAS
GPU Automatically Tuned Linear Algebra Software
☆28Updated 9 years ago
intfloat / coursera
related materials for coursera & edx MOOCs, will no longer update.
☆63Updated 9 years ago