ROCm / ROC_SHMEM
ROC_SHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.
☆39Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ROC_SHMEM
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆43Updated last month
- Advanced Profiling and Analytics for AMD Hardware☆135Updated this week
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆60Updated 6 years ago
- ☆44Updated last week
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆31Updated 3 years ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆63Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆67Updated 10 months ago
- RCCL Performance Benchmark Tests☆50Updated 3 weeks ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆130Updated this week
- Chai☆42Updated 11 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆75Updated this week
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆72Updated 8 months ago
- ☆16Updated this week
- ☆41Updated 4 years ago
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- HPCG benchmark based on ROCm platform☆35Updated 3 weeks ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆35Updated this week
- oneAPI Level Zero Conformance & Performance test content☆46Updated this week
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆40Updated 8 months ago
- GPUDirect Async support for IB Verbs☆90Updated 2 years ago
- Bandwidth test for ROCm☆47Updated 2 weeks ago
- ☆59Updated this week
- ☆47Updated 5 years ago
- A Micro-benchmarking Tool for HPC Networks☆21Updated 3 weeks ago
- ☆17Updated 10 months ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆45Updated last month
- SYCL Benchmark Suite☆56Updated 2 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆75Updated last week
- Next generation SPARSE implementation for ROCm platform☆116Updated this week
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆30Updated last year