JGU-HPC / parallelprogrammingbook
supplementary material/programming exercises
☆73Updated 3 years ago
Alternatives and similar repositories for parallelprogrammingbook
Users that are interested in parallelprogrammingbook are comparing it to the libraries listed below
Sorting:
- A warp-oriented dynamic hash table for GPUs☆73Updated last year
- BGHT: High-performance static GPU hash tables.☆63Updated last month
- Efficient SpGEMM on GPU using CUDA and CSR☆54Updated last year
- A Library for fast Hash Tables on GPUs☆117Updated 2 years ago
- ☆43Updated 4 years ago
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆23Updated last year
- Learn OpenMP examples step by step☆93Updated 3 months ago
- My notes on various HPC papers.☆22Updated 2 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆88Updated last year
- ☆67Updated 11 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆108Updated last year
- ☆91Updated 8 years ago
- Main Book repository for the Parallel and High Performance Computing book, Manning Publications☆205Updated 2 years ago
- Examples for HPC course☆39Updated 4 years ago
- General Purpose Timing Library☆34Updated last year
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆55Updated 2 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆151Updated last year
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- MagmaDNN: a simple deep learning framework in c++☆49Updated 4 years ago
- Online CUDA Occupancy Calculator☆76Updated 3 years ago
- A cross-platform CUDA/C++17 starter project with google test and google benchmark support.☆38Updated last month
- 🎃 GPU load-balancing library for regular and irregular computations.☆62Updated 11 months ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆28Updated 4 years ago
- High-performance, GPU-aware communication library☆85Updated 4 months ago
- ☆64Updated 2 years ago
- Full-speed Array of Structures access☆169Updated 2 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆131Updated 4 years ago
- TLB Benchmarks☆33Updated 7 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated last month