ilyak / openmp-tutorial
OpenMP tutorial
☆36Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for openmp-tutorial
- An expression template based linear algebra library running completely on the GPU using CUDA☆22Updated 3 years ago
- ☆42Updated 6 years ago
- Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )☆57Updated last week
- ☆22Updated 5 years ago
- Numerical optimization in C++☆38Updated 10 years ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆43Updated last week
- some CUDA programming example☆25Updated 7 years ago
- C++ implementation of sparse matrix using CRS (Compressed Row Storage) format☆111Updated 4 years ago
- High-Performance Computing: CPU Instructions, GPU OpenCL & CUDA, etc.☆14Updated 6 months ago
- Utilities for CUDA programming☆39Updated 5 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆146Updated last year
- The CMake version of cuda_by_example☆145Updated 4 years ago
- A cross-platform CUDA/C++17 starter project with google test and google benchmark support.☆37Updated last year
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆82Updated last year
- Matrix Multiplication on GPU using Shared Memory considering Coalescing and Bank Conflicts☆24Updated 2 years ago
- KMeans clustering in Eigen.☆25Updated 8 years ago
- Learn OpenCL step by step.☆132Updated 2 years ago
- openmp examples☆136Updated 5 years ago
- Example code used in the CVPR 2015 tutorial☆39Updated 9 years ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- Large scale C++ Software development tutorials☆76Updated 5 years ago
- pdf☆87Updated 6 years ago
- CUDA by practice☆116Updated 4 years ago
- Header-only/compiled C++ numerical compute library.☆29Updated last year
- A Light-weight and Fast Template Matrix Library☆131Updated 11 years ago
- ☆21Updated 7 years ago
- Source code examples from the Parallel Forall Blog☆94Updated 5 years ago
- A shallow fork of SuiteSparse adding build files for Visual Studio and support for ACML☆100Updated 9 years ago
- Skeletonide is a parallel implementation of Zhang-Suen morphological thinning algorithm written in Halide-lang. Use it for fast skeletoni…☆12Updated 4 years ago
- MWE for using the Eigen library in CUDA kernels☆117Updated 2 years ago