davidrohr / caldgemmLinks
Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL
☆17Updated 7 years ago
Alternatives and similar repositories for caldgemm
Users that are interested in caldgemm are comparing it to the libraries listed below
Sorting:
- Scripts for running various benchmarks on Isambard and other systems.☆28Updated 4 years ago
- OpenMP vs Offload☆22Updated 2 years ago
- ☆18Updated last year
- ☆11Updated 4 years ago
- ☆14Updated 4 years ago
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- ext_mpi_collectives☆11Updated 6 months ago
- Pragmatic, Productive, and Portable Affinity for HPC☆48Updated this week
- ☆17Updated last week
- A Monte Carlo transport mini-app for studying new parallel algorithms☆17Updated 4 months ago
- Compute applications.☆25Updated 5 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11Updated 7 years ago
- Tools to run and parse MKL verbose mode☆18Updated 3 years ago
- A Monte Carlo Neutron Transport Mini-App☆15Updated 6 years ago
- Unstructured mesh hydrodynamics for advanced architectures☆23Updated 2 years ago
- A C++based implementation of the TeaLeaf heat conduction mini-app. This implementation of TeaLeaf replicates the functionality of the ref…☆23Updated last year
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆46Updated last year
- ☆14Updated last week
- Logger for MPI communication☆27Updated 2 years ago
- ☆10Updated 6 months ago
- Automatically exported from code.google.com/p/patus☆16Updated 10 years ago
- JUPITER Benchmark Suite☆20Updated 2 months ago
- Aries Network Performance Counters Monitoring Library☆11Updated 4 years ago
- Data and reproducibility scripts for the UoB-HPC Performance Portability studies☆17Updated last year
- Scripts to build AMD ROCm from source.☆16Updated 11 months ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆32Updated 2 years ago
- Parallel Computing -- Validation Suite: Validation engine for Exascale project benchmarks☆15Updated 2 months ago
- Molecular dynamics proxy application based on Kokkos☆33Updated last year
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆40Updated 3 months ago
- NAS Parallel Benchmarks for evaluating GPU and APIs☆27Updated last week