jslee02 / awesome-gpgpu
A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources
☆90Updated 2 years ago
Alternatives and similar repositories for awesome-gpgpu:
Users that are interested in awesome-gpgpu are comparing it to the libraries listed below
- Algorithms implemented in CUDA + resources about GPGPU☆55Updated 3 years ago
- CUDA Guide☆64Updated last year
- OpenCL Guide☆18Updated 3 years ago
- This is a list of useful libraries and resources for CUDA development.☆561Updated 7 years ago
- Tensor Tiling Library☆36Updated last week
- A collection of awesome algorithms, implemented in CUDA.☆25Updated 7 years ago
- Learn OpenCL step by step.☆135Updated 2 years ago
- Graphics Processing Unit (GPU) Architecture Guide☆201Updated 3 years ago
- A micro Vulkan compute pipeline and a collection of benchmarking compute shaders☆238Updated last month
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last month
- Implement Neural Networks in Cuda from Scratch☆22Updated 11 months ago
- Corrected source for the OpenCL in Action book (work in progress)☆64Updated 11 years ago
- Set of utilities supporting workflows common in GPU raytracing applications☆111Updated last week
- AMD's graph optimization engine.☆215Updated this week
- Implementation of a few sorting algorithms in OpenCL☆35Updated 5 years ago
- Sample benchmark demonstrating the VK_KHR_cooperative_matrix extension☆88Updated last month
- IREE's PyTorch Frontend, based on Torch Dynamo.☆79Updated this week
- A profiler to disclose and quantify hardware features on GPUs.☆168Updated 2 years ago
- ☆45Updated this week
- CUDA kernel author's tools☆111Updated 3 years ago
- ☆62Updated 2 months ago
- rocWMMA☆109Updated this week
- LLM training in simple, raw C/CUDA☆92Updated 11 months ago
- Vulkan Guide☆28Updated 3 years ago
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal - all it takes to sum a lot of numbers fast!☆96Updated 2 months ago
- μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updatin…☆173Updated 3 weeks ago
- Training material for Nsight developer tools☆156Updated 8 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆187Updated 2 months ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 3 months ago
- Simple example of using Vulkan for GPGPU computing☆53Updated 6 years ago