ROCm / rocSHMEM
ROC_SHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.
☆41Updated last year
Related projects ⓘ
Alternatives and complementary repositories for rocSHMEM
- Advanced Profiling and Analytics for AMD Hardware☆138Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆44Updated last month
- HPCG benchmark based on ROCm platform☆35Updated this week
- ☆16Updated this week
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆60Updated 6 years ago
- ☆41Updated 4 years ago
- ROCm SPARSE marshalling library☆69Updated this week
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆31Updated 3 years ago
- RCCL Performance Benchmark Tests☆51Updated last month
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆36Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆68Updated 10 months ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆132Updated this week
- ☆47Updated 5 years ago
- A tracing infrastructure for heterogeneous computing applications.☆25Updated last week
- oneAPI Level Zero Conformance & Performance test content☆47Updated this week
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆63Updated last week
- Chai☆42Updated 11 months ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆101Updated this week
- Reusable software components for ROCm developers☆79Updated this week
- ☆17Updated 10 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆75Updated last week
- ☆59Updated this week
- Next generation LAPACK implementation for ROCm platform☆95Updated this week
- ☆20Updated last year
- Next generation SPARSE implementation for ROCm platform☆117Updated this week
- ☆44Updated this week
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated 2 months ago
- ROCm Parallel Primitives☆163Updated this week
- 🎃 GPU load-balancing library for regular and irregular computations.☆58Updated 5 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago