nickspell / udacity-IntroToParallelProgramming
CS344 - Introduction To Parallel Programming course (Udacity) proposed solutions
☆53Updated 7 years ago
Alternatives and similar repositories for udacity-IntroToParallelProgramming:
Users that are interested in udacity-IntroToParallelProgramming are comparing it to the libraries listed below
- CUDA by practice☆125Updated 5 years ago
- Windows Visual Studio Solutions for class "Introduction to Parallel Programming"☆19Updated 6 years ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- Google Colab Notebooks for Udacity CS344 - Intro to Parallel Programming☆133Updated 4 years ago
- cuDNN sample codes provided by Nvidia☆45Updated 6 years ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆49Updated 4 years ago
- Fast CUDA Kernels for ResNet Inference.☆173Updated 5 years ago
- ☆44Updated 7 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆84Updated last year
- Training material for Nsight developer tools☆156Updated 8 months ago
- CNNs in Halide☆23Updated 9 years ago
- ☆59Updated 2 years ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆70Updated 9 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆150Updated last year
- BGHT: High-performance static GPU hash tables.☆63Updated 2 weeks ago
- ☆436Updated 9 years ago
- ☆22Updated 5 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆130Updated 4 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 11 months ago
- Introduction to CUDA programming☆116Updated 7 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆81Updated 5 years ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- flexible-gemm conv of deepcore☆17Updated 5 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- CUDA Matrix Multiplication Optimization☆181Updated 9 months ago
- Algorithms implemented in CUDA + resources about GPGPU☆55Updated 3 years ago
- Learn OpenCL step by step.☆135Updated 2 years ago
- CUDA for MNIST training/inference☆40Updated last year
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- Subpart source code of of deepcore v0.7☆27Updated 4 years ago