ROCm / rocHPL
High Performance Linpack for Next-Generation AMD HPC Accelerators
☆51Updated this week
Alternatives and similar repositories for rocHPL:
Users that are interested in rocHPL are comparing it to the libraries listed below
- HPCG benchmark based on ROCm platform☆37Updated 3 weeks ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆71Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆145Updated this week
- ☆17Updated last year
- ROC profiler library. Profiling with perf-counters and derived metrics.☆141Updated this week
- ☆23Updated last week
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆25Updated this week
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆78Updated last year
- ☆12Updated last month
- Logger for MPI communication☆26Updated last year
- Reference implementations of MLPerf™ HPC training benchmarks☆47Updated last month
- ROCm SPARSE marshalling library☆67Updated this week
- Very-Low Overhead Checkpointing System☆57Updated 2 months ago
- Next generation SPARSE implementation for ROCm platform☆119Updated this week
- ☆14Updated 4 years ago
- Next generation LAPACK implementation for ROCm platform☆99Updated this week
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆43Updated this week
- RAJA Performance Suite☆119Updated this week
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week
- ☆35Updated last week
- RCCL Performance Benchmark Tests☆60Updated this week
- A tracing infrastructure for heterogeneous computing applications.☆31Updated this week
- MPI accelerator-integrated communication extensions☆33Updated 2 years ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆51Updated last month
- A Micro-benchmarking Tool for HPC Networks☆27Updated 2 months ago
- Comb is a communication performance benchmarking tool.☆24Updated 2 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆107Updated last year
- NAS Parallel Benchmarks for evaluating GPU and APIs☆23Updated last month
- Examples illustrating usage of the rocBLAS library☆14Updated 8 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated last week