nickspell / udacity-IntroToParallelProgramming
CS344 - Introduction To Parallel Programming course (Udacity) proposed solutions
☆54Updated 7 years ago
Alternatives and similar repositories for udacity-IntroToParallelProgramming:
Users that are interested in udacity-IntroToParallelProgramming are comparing it to the libraries listed below
- Windows Visual Studio Solutions for class "Introduction to Parallel Programming"☆19Updated 6 years ago
- Google Colab Notebooks for Udacity CS344 - Intro to Parallel Programming☆134Updated 3 years ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆48Updated 3 years ago
- Fast CUDA Kernels for ResNet Inference.☆173Updated 5 years ago
- CUDA by practice☆125Updated 5 years ago
- ☆66Updated 11 years ago
- cuDNN sample codes provided by Nvidia☆45Updated 6 years ago
- BGHT: High-performance static GPU hash tables.☆62Updated 6 months ago
- Dissecting NVIDIA GPU Architecture☆90Updated 2 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆197Updated 2 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆149Updated last year
- Introduction to CUDA programming☆115Updated 7 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- ☆22Updated 5 years ago
- ☆109Updated 11 months ago
- ☆42Updated 7 years ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- implementation of winograd minimal convolution algorithm on Intel Architecture☆39Updated 7 years ago
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆68Updated 5 years ago
- CUDA Matrix Multiplication Optimization☆173Updated 8 months ago
- BLISlab: A Sandbox for Optimizing GEMM☆509Updated 3 years ago
- CNNs in Halide☆23Updated 9 years ago
- Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.☆330Updated 2 months ago
- Training material for Nsight developer tools☆151Updated 7 months ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆130Updated 4 years ago
- CUDA 6大并行计算模式 代码与笔记☆60Updated 4 years ago
- ☆431Updated 9 years ago
- A library of GPU kernels for sparse matrix operations.☆260Updated 4 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆89Updated last year
- Learn OpenCL step by step.☆134Updated 2 years ago