davidrohr / caldgemmLinks
Portable and Flexible DGEMM Library for GPUs (OpenCL, CUDA, CAL) with special support for HPL
☆17Updated 7 years ago
Alternatives and similar repositories for caldgemm
Users that are interested in caldgemm are comparing it to the libraries listed below
Sorting:
- A Monte Carlo transport mini-app for studying new parallel algorithms☆17Updated 2 months ago
- Aries Network Performance Counters Monitoring Library☆11Updated 4 years ago
- Dynamic execution environments for coupled, thread-heterogeneous MPI+X applications☆21Updated 4 months ago
- Scripts for running various benchmarks on Isambard and other systems.☆28Updated 4 years ago
- A C++based implementation of the TeaLeaf heat conduction mini-app. This implementation of TeaLeaf replicates the functionality of the ref…☆23Updated 11 months ago
- Compute applications.☆24Updated 5 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆44Updated last year
- ☆11Updated 3 years ago
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- Official BOLT Repository☆30Updated 10 months ago
- Oak Ridge OpenSHMEM Benchmarks☆15Updated 7 years ago
- Unstructured mesh hydrodynamics for advanced architectures☆22Updated last year
- A mirror of cinch's internal gitlab repository.☆21Updated 2 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11Updated 7 years ago
- Classical molecular dynamics proxy application.☆32Updated 5 years ago
- MPI Library Memory Consumption Utilities☆18Updated 2 years ago
- ReMPI (MPI Record-and-Replay)☆39Updated last year
- Logger for MPI communication☆27Updated 2 years ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆15Updated 9 months ago
- Tools to run and parse MKL verbose mode☆18Updated 3 years ago
- Dynamic Loop Self-scheduling For Load Balancing (DLS4LB) is an MPI-Based load balancing library. It is implemented in C and FORTRAN (F90)…☆16Updated 2 years ago
- ☆13Updated last month
- Tools for MPI programmers☆14Updated 4 years ago
- A hydrodynamics mini-app to solve the compressible Euler equations in 2D, using an explicit, second-order method.☆56Updated 5 years ago
- OpenMP vs Offload☆22Updated 2 years ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- OpenMP offload playground☆10Updated 7 months ago
- Open source of an IBM Optimized version of the HPCG benchmark.☆15Updated last year
- Scalable Integer Sort application for co-design in the exascale era☆19Updated 4 years ago
- Pragmatic, Productive, and Portable Affinity for HPC☆41Updated last month