davidrohr / caldgemmLinks
Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL
☆17Updated 7 years ago
Alternatives and similar repositories for caldgemm
Users that are interested in caldgemm are comparing it to the libraries listed below
Sorting:
- ☆18Updated 2 years ago
- A Monte Carlo transport mini-app for studying new parallel algorithms☆18Updated last month
- Compute applications.☆25Updated 6 years ago
- Comb is a communication performance benchmarking tool.☆26Updated 2 years ago
- Unstructured mesh hydrodynamics for advanced architectures☆23Updated 2 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11Updated 7 years ago
- Scripts for running various benchmarks on Isambard and other systems.☆29Updated 4 years ago
- OpenMP vs Offload☆23Updated 2 years ago
- ☆11Updated 4 years ago
- Automatically exported from code.google.com/p/patus☆16Updated 10 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆47Updated 2 years ago
- ext_mpi_collectives☆11Updated 10 months ago
- ☆16Updated 2 months ago
- Pragmatic, Productive, and Portable Affinity for HPC☆51Updated 3 weeks ago
- ☆15Updated 5 years ago
- Data and reproducibility scripts for the UoB-HPC Performance Portability studies☆18Updated last year
- Aries Network Performance Counters Monitoring Library☆11Updated 5 years ago
- CPU and GPU tutorial examples☆13Updated 10 months ago
- Tools to run and parse MKL verbose mode☆18Updated 3 years ago
- Training examples for SYCL☆49Updated 2 months ago
- ☆19Updated 3 weeks ago
- JUPITER Benchmark Suite☆23Updated 6 months ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆25Updated 8 months ago
- A Monte Carlo Neutron Transport Mini-App☆15Updated 6 years ago
- Oak Ridge OpenSHMEM Benchmarks☆15Updated 7 years ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Updated 2 years ago
- Parallel Computing -- Validation Suite: Validation engine for Exascale project benchmarks☆15Updated 3 months ago
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆40Updated 3 weeks ago
- ReMPI (MPI Record-and-Replay)☆40Updated last year
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆38Updated last year