jslee02 / awesome-gpgpu
A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources
☆88Updated 2 years ago
Alternatives and similar repositories for awesome-gpgpu:
Users that are interested in awesome-gpgpu are comparing it to the libraries listed below
- Algorithms implemented in CUDA + resources about GPGPU☆55Updated 3 years ago
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal - all it takes to sum a lot of numbers fast!☆90Updated 2 weeks ago
- CUDA Guide☆62Updated last year
- Tensor Tiling Library☆34Updated last week
- ☆61Updated last month
- This is a list of useful libraries and resources for CUDA development.☆552Updated 7 years ago
- A micro Vulkan compute pipeline and a collection of benchmarking compute shaders☆233Updated 7 months ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last year
- Graphics Processing Unit (GPU) Architecture Guide☆189Updated 3 years ago
- Collection of easy, well-documented and useful OpenCL examples in C++.☆73Updated 3 years ago
- Implement Neural Networks in Cuda from Scratch☆22Updated 9 months ago
- GPUOcelot: A dynamic compilation framework for PTX☆178Updated last month
- Learn OpenCL step by step.☆133Updated 2 years ago
- A collection of awesome algorithms, implemented in CUDA.☆24Updated 7 years ago
- Source Code for 'Pro TBB: C++ Parallel Programming with Threading Building Blocks' by Michael Voss, Rafael Asenjo, and James Reinders☆177Updated last month
- Simple example of using Vulkan for GPGPU computing☆53Updated 6 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆111Updated 9 months ago
- Intel(R) Open Volume Kernel Library☆205Updated 3 months ago
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆59Updated this week
- ☆43Updated this week
- Set of utilities supporting workflows common in GPU raytracing applications☆107Updated last month
- CUDA kernel author's tools☆110Updated 2 years ago
- OpenCL Guide☆17Updated 3 years ago
- A profiler to disclose and quantify hardware features on GPUs.☆167Updated 2 years ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆263Updated 2 months ago
- ☆218Updated 3 weeks ago
- Sample benchmark demonstrating the VK_KHR_cooperative_matrix extension☆86Updated 2 months ago
- ☆41Updated 5 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆149Updated last year
- μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updatin…☆172Updated 2 weeks ago