nickspell / udacity-IntroToParallelProgramming
CS344 - Introduction To Parallel Programming course (Udacity) proposed solutions
☆52Updated 7 years ago
Alternatives and similar repositories for udacity-IntroToParallelProgramming:
Users that are interested in udacity-IntroToParallelProgramming are comparing it to the libraries listed below
- Google Colab Notebooks for Udacity CS344 - Intro to Parallel Programming☆133Updated 3 years ago
- Windows Visual Studio Solutions for class "Introduction to Parallel Programming"☆19Updated 6 years ago
- CUDA by practice☆121Updated 5 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆130Updated 4 years ago
- Training material for Nsight developer tools☆143Updated 5 months ago
- CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. …☆375Updated last year
- cuDNN sample codes provided by Nvidia☆45Updated 5 years ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆47Updated 3 years ago
- Fast CUDA Kernels for ResNet Inference.☆169Updated 5 years ago
- Introduction to CUDA programming☆115Updated 7 years ago
- Algorithms implemented in CUDA + resources about GPGPU☆53Updated 3 years ago
- A way to use cuda to accelerate top k algorithm☆29Updated 7 years ago
- Implementation of a simple CNN using CUDA☆66Updated 7 years ago
- ☆65Updated 10 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆88Updated last year
- Some CUDA design patterns and a bit of template magic for CUDA☆148Updated last year
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆83Updated 11 months ago
- Learn OpenCL step by step.☆131Updated 2 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆196Updated 2 years ago
- "Hardware, Software, and Compilers! Oh My!" tutorial files☆17Updated 5 years ago
- implementation of winograd minimal convolution algorithm on Intel Architecture☆39Updated 7 years ago
- Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)☆122Updated 4 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- ☆406Updated 9 years ago
- Template for starting CUDA/C++ project using CMake with Github Action for CI☆29Updated 2 years ago
- IMPACT GPU Algorithms Teaching Labs☆56Updated last year
- CUDA for MNIST training/inference☆37Updated last year
- CUDA Matrix Multiplication Optimization☆155Updated 6 months ago
- ☆108Updated 9 months ago