ROCm / amd_matrix_instruction_calculator
A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators
☆73Updated last year
Alternatives and similar repositories for amd_matrix_instruction_calculator:
Users that are interested in amd_matrix_instruction_calculator are comparing it to the libraries listed below
- ☆134Updated this week
- rocWMMA☆100Updated this week
- amdgpu example code in hip/asm☆25Updated 3 weeks ago
- Advanced Profiling and Analytics for AMD Hardware☆139Updated this week
- An extension library of WMMA API (Tensor Core API)☆87Updated 6 months ago
- Dissecting NVIDIA GPU Architecture☆84Updated 2 years ago
- ☆60Updated last month
- ☆84Updated 9 months ago
- ☆34Updated last year
- Stretching GPU performance for GEMMs and tensor contractions.☆231Updated this week
- collection of benchmarks to measure basic GPU capabilities☆287Updated 3 weeks ago
- Next generation SPARSE implementation for ROCm platform☆118Updated this week
- ROCm Parallel Primitives☆169Updated this week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- AMD lab notes with code examples to demonstrate use of AMD GPUs☆94Updated 7 months ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆48Updated this week
- Bandwidth test for ROCm☆53Updated this week
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆79Updated this week
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated 4 months ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆37Updated 6 months ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆104Updated this week
- Assembler for NVIDIA Volta and Turing GPUs☆204Updated 3 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆129Updated this week
- ☆19Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆234Updated this week
- ☆40Updated 4 years ago
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆129Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆133Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆216Updated 2 weeks ago
- SYCL Benchmark Suite☆60Updated 4 months ago