AMD-HPC / CoralGemm
☆12Updated last month
Alternatives and similar repositories for CoralGemm:
Users that are interested in CoralGemm are comparing it to the libraries listed below
- ☆17Updated last year
- Comb is a communication performance benchmarking tool.☆24Updated 2 years ago
- ☆14Updated 4 years ago
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆34Updated 4 months ago
- HPCG benchmark based on ROCm platform☆37Updated last week
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆40Updated last year
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆48Updated last month
- Department of Energy Standard Utility Library☆31Updated 3 weeks ago
- Compute applications.☆24Updated 5 years ago
- MPI accelerator-integrated communication extensions☆32Updated last year
- Benchmarks☆15Updated 5 months ago
- ext_mpi_collectives☆10Updated last year
- A Monte Carlo transport mini-app for studying new parallel algorithms☆17Updated 2 months ago
- Training examples for SYCL☆39Updated last month
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆25Updated this week
- OpenMP vs Offload☆21Updated last year
- Oak Ridge OpenSHMEM Benchmarks☆14Updated 6 years ago
- Unstructured mesh hydrodynamics for advanced architectures☆21Updated last year
- JUPITER Benchmark Suite☆15Updated 7 months ago
- Scripts for running various benchmarks on Isambard and other systems.☆28Updated 3 years ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆23Updated 4 months ago
- CPU and GPU tutorial examples☆13Updated 3 weeks ago
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆42Updated this week
- A tracing infrastructure for heterogeneous computing applications.☆30Updated this week
- Pragmatic, Productive, and Portable Affinity for HPC☆35Updated last week
- Header-only library of GPU-accelerated, concurrent data structures.☆10Updated 3 weeks ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆31Updated 4 months ago
- ☆11Updated 3 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- Reusable software components for ROCm developers☆83Updated last week