reger-men / HPL_GPULinks
High-Performance Linpack Benchmark adopted version for GPU backend
☆11Updated 2 years ago
Alternatives and similar repositories for HPL_GPU
Users that are interested in HPL_GPU are comparing it to the libraries listed below
Sorting:
- An HPL-AI implementation for Fugaku☆21Updated 3 years ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆54Updated 2 weeks ago
- Pragmatic, Productive, and Portable Affinity for HPC☆40Updated 3 weeks ago
- ☆13Updated last month
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated 2 months ago
- Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner☆20Updated last year
- HPCG benchmark based on ROCm platform☆37Updated last week
- RCCL Performance Benchmark Tests☆67Updated 2 weeks ago
- Compute applications.☆24Updated 5 years ago
- Bandwidth test for ROCm☆56Updated 2 weeks ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆83Updated last week
- ☆44Updated 4 years ago
- HCC Sample Applications☆13Updated 8 years ago
- ☆14Updated 4 years ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆39Updated last week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆147Updated last week
- ☆22Updated 2 years ago
- COCCL: Compression and precision co-aware collective communication library☆22Updated 2 months ago
- A sparse BLAS lib supporting multiple backends☆43Updated 3 months ago
- ☆17Updated 3 years ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆24Updated 7 years ago
- GPU Performance Advisor☆65Updated 2 years ago
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆25Updated this week
- Get started with your NVIDIA Arm HPC Developers Kit!☆33Updated 2 years ago
- ☆36Updated last week
- ☆18Updated last year
- Slides and exercises for persistent memory programming tutorial☆13Updated 2 years ago
- ☆15Updated last month
- Dissecting NVIDIA GPU Architecture☆95Updated 2 years ago
- A hierarchical collective communications library with portable optimizations☆35Updated 5 months ago