udacity / cs344
Introduction to Parallel Programming class code
☆1,315Updated 2 years ago
Alternatives and similar repositories for cs344:
Users that are interested in cs344 are comparing it to the libraries listed below
- Source code that accompanies The CUDA Handbook.☆521Updated last month
- Source code examples from the Parallel Forall Blog☆1,270Updated 8 months ago
- This is an archive of materials produced for an introductory class on CUDA programming at Stanford University in 2010☆214Updated 2 years ago
- ☆427Updated 9 years ago
- BLISlab: A Sandbox for Optimizing GEMM☆507Updated 3 years ago
- This is a list of useful libraries and resources for CUDA development.☆553Updated 7 years ago
- Automatically exported from code.google.com/p/cuda-convnet2☆793Updated 9 years ago
- My solutions to Udacity's Parallel Programming course (CS 344)☆75Updated 7 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,735Updated last year
- Learn CUDA Programming, published by Packt☆1,120Updated last year
- CUDA official sample codes☆365Updated 9 years ago
- Assembler for NVIDIA Maxwell architecture☆981Updated 2 years ago
- Patterns and behaviors for GPU computing☆1,707Updated 2 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆264Updated last year
- Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)☆721Updated 7 months ago
- Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch☆814Updated last year
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆1,111Updated 5 years ago
- CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. …☆401Updated last year
- Google Colab Notebooks for Udacity CS344 - Intro to Parallel Programming☆133Updated 3 years ago
- CUDA Data Parallel Primitives Library☆428Updated 6 years ago
- ☆1,841Updated last year
- Dive into Deep Learning Compiler☆647Updated 2 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,685Updated 9 months ago
- CNN accelerated by cuda. Test on mnist and finilly get 99.76%☆187Updated 7 years ago
- CS344 - Introduction To Parallel Programming course (Udacity) proposed solutions☆54Updated 7 years ago
- Easy benchmarking of all publicly accessible implementations of convnets☆2,683Updated 7 years ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆663Updated last month
- Source code repository for the projects from CUDA for Engineers☆129Updated 3 years ago
- ATen: A TENsor library for C++11☆694Updated 5 years ago
- C++ extensions in PyTorch☆1,073Updated 2 months ago