JGU-HPC / parallelprogrammingbookLinks
supplementary material/programming exercises
☆74Updated 4 years ago
Alternatives and similar repositories for parallelprogrammingbook
Users that are interested in parallelprogrammingbook are comparing it to the libraries listed below
Sorting:
- A Library for fast Hash Tables on GPUs☆132Updated 3 months ago
- tools to create performance and roofline plots from measured data☆60Updated 11 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆95Updated 2 years ago
- A warp-oriented dynamic hash table for GPUs☆76Updated 2 years ago
- Fast and full-featured Matrix Market I/O library for C++, Python, and R☆86Updated last year
- BGHT: High-performance static GPU hash tables.☆71Updated 6 months ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆158Updated 2 years ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 4 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆80Updated 5 months ago
- ☆49Updated 5 years ago
- Scalable High-performance Algorithms and Data-structures☆135Updated last month
- ☆97Updated 8 years ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆33Updated this week
- 🎃 GPU load-balancing library for regular and irregular computations.☆64Updated 4 months ago
- ☆71Updated 11 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆76Updated 2 years ago
- Parallel Algorithm Scheduling Library☆106Updated 8 years ago
- ☆273Updated last week
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆223Updated 3 years ago
- Efficient SpGEMM on GPU using CUDA and CSR☆59Updated 2 years ago
- Examples for using SYCL on CUDA☆62Updated 4 months ago
- CUSP : A C++ Templated Sparse Matrix Library☆421Updated 5 months ago
- Stencil Probe - a stencil microbenchmark☆30Updated 13 years ago
- MagmaDNN: a simple deep learning framework in c++☆51Updated 5 years ago
- Kernel Tuning Toolkit☆66Updated 2 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆114Updated 2 years ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆212Updated 2 weeks ago
- MPI+OpenMP implementation of the first phase of Louvain method for Graph Community Detection☆25Updated last year
- A GPU accelerated error-bounded lossy compression for scientific data.☆94Updated 3 weeks ago