rbaygildin / learn-gpgpu
Algorithms implemented in CUDA + resources about GPGPU
☆55Updated 3 years ago
Alternatives and similar repositories for learn-gpgpu:
Users that are interested in learn-gpgpu are comparing it to the libraries listed below
- A collection of awesome algorithms, implemented in CUDA.☆25Updated 7 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆89Updated last year
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last week
- Concurrent CPU-GPU Programming using Task Models☆101Updated 5 years ago
- ☆42Updated 7 years ago
- ☆66Updated 11 years ago
- Examples for using SYCL on CUDA☆62Updated 3 weeks ago
- Learn OpenCL step by step.☆134Updated 2 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆150Updated last year
- Learn OpenMP examples step by step☆91Updated 2 months ago
- portDNN is a library implementing neural network algorithms written using SYCL☆111Updated 10 months ago
- ☆23Updated 3 years ago
- A Collection of Articles and other OpenCL Papers☆56Updated 6 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆28Updated 4 years ago
- OpenCL Tutorials☆52Updated 5 years ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆117Updated 2 months ago
- CNNs in Halide☆23Updated 9 years ago
- BLAS implementation for Intel FPGA☆78Updated 4 years ago
- A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources☆89Updated 2 years ago
- A domain-specific language and compiler for image processing☆76Updated 4 years ago
- BGHT: High-performance static GPU hash tables.☆62Updated 6 months ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 6 years ago
- ☆38Updated 3 years ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆59Updated 2 weeks ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆21Updated 9 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Updated 6 years ago
- My notes on various HPC papers.☆22Updated 2 years ago
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆116Updated 4 months ago
- ☆11Updated 4 years ago