muatik / openmp-examples
openmp examples
☆141Updated 5 years ago
Alternatives and similar repositories for openmp-examples:
Users that are interested in openmp-examples are comparing it to the libraries listed below
- Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )☆59Updated last week
- Source Code for 'Pro TBB: C++ Parallel Programming with Threading Building Blocks' by Michael Voss, Rafael Asenjo, and James Reinders☆177Updated 2 weeks ago
- OpenMP tutorial☆37Updated 7 years ago
- Learn OpenMP examples step by step☆89Updated 3 weeks ago
- 大规模并行处理器编程实战 第二版答案☆30Updated 2 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆89Updated last year
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- The CMake version of cuda_by_example☆146Updated 4 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆83Updated 11 months ago
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆126Updated 3 years ago
- A library of various helper routines and frameworks used by many of the lab's software☆47Updated 9 months ago
- Implementation of breadth first search on GPU with CUDA Driver API.☆47Updated 3 years ago
- Future home of hpc-tutorials.llnl.gov☆232Updated 6 months ago
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆259Updated last month
- pdf☆89Updated 6 years ago
- Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"☆130Updated 2 months ago
- Implementation and analysis of five different GPU based SPMV algorithms in CUDA☆38Updated 6 years ago
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆209Updated 2 months ago
- ☆261Updated 4 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆55Updated 2 years ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆43Updated 3 months ago
- Examples for using SYCL on CUDA☆60Updated 2 weeks ago
- ☆417Updated 9 years ago
- BLISlab: A Sandbox for Optimizing GEMM☆496Updated 3 years ago
- ☆93Updated 8 years ago
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆58Updated this week
- Learn OpenCL step by step.☆133Updated 2 years ago
- Little OpenMP Library☆157Updated 2 years ago
- Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]☆61Updated 2 years ago
- CUDA by practice☆121Updated 5 years ago