davidrohr / caldgemmLinks
Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL
☆17Updated 7 years ago
Alternatives and similar repositories for caldgemm
Users that are interested in caldgemm are comparing it to the libraries listed below
Sorting:
- Scripts for running various benchmarks on Isambard and other systems.☆29Updated 4 years ago
- Unstructured mesh hydrodynamics for advanced architectures☆23Updated 2 years ago
- ☆11Updated 4 years ago
- Proxy app for Nek5000☆13Updated 7 years ago
- A Monte Carlo Neutron Transport Mini-App☆15Updated 6 years ago
- ext_mpi_collectives☆11Updated 8 months ago
- A Monte Carlo transport mini-app for studying new parallel algorithms☆18Updated this week
- Pragmatic, Productive, and Portable Affinity for HPC☆49Updated 3 weeks ago
- Automatically exported from code.google.com/p/patus☆16Updated 10 years ago
- OpenMP vs Offload☆23Updated 2 years ago
- JUPITER Benchmark Suite☆21Updated 5 months ago
- Compute applications.☆25Updated 6 years ago
- Tools to run and parse MKL verbose mode☆18Updated 3 years ago
- ReMPI (MPI Record-and-Replay)☆40Updated last year
- Aries Network Performance Counters Monitoring Library☆11Updated 5 years ago
- A C++based implementation of the TeaLeaf heat conduction mini-app. This implementation of TeaLeaf replicates the functionality of the ref…☆24Updated last year
- HiCMA: Hierarchical Computations on Manycore Architectures☆34Updated 2 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11Updated 7 years ago
- ☆18Updated last year
- OpenMP offload playground☆10Updated last year
- ☆14Updated 5 years ago
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆50Updated 6 years ago
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆38Updated last year
- Parallel Computing -- Validation Suite: Validation engine for Exascale project benchmarks☆15Updated last month
- ☆16Updated last month
- Introduction to OpenACC☆30Updated 4 years ago
- This repository contains application codes and solutions for the Book on "OpenACC for Programmers - Concept & Strategies".☆34Updated 6 years ago
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆41Updated 2 months ago
- Dynamic Loop Self-scheduling For Load Balancing (DLS4LB) is an MPI-Based load balancing library. It is implemented in C and FORTRAN (F90)…☆16Updated 2 years ago