CRobeck / instrument-amdgpu-kernelsLinks
LLVM/MLIR based compiler instrumentation of AMD GPU kernels
☆18Updated 2 months ago
Alternatives and similar repositories for instrument-amdgpu-kernels
Users that are interested in instrument-amdgpu-kernels are comparing it to the libraries listed below
Sorting:
- ☆148Updated this week
- Dissecting NVIDIA GPU Architecture☆99Updated 3 years ago
- ☆64Updated 6 years ago
- ☆52Updated 5 years ago
- ☆38Updated 3 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆137Updated last week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆107Updated last month
- development repository for the open earth compiler☆80Updated 4 years ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆69Updated last week
- TPP experimentation on MLIR for linear algebra☆132Updated last week
- rocWMMA☆118Updated last week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆91Updated this week
- An extension library of WMMA API (Tensor Core API)☆99Updated last year
- 🎃 GPU load-balancing library for regular and irregular computations.☆62Updated last year
- ☆44Updated 4 years ago
- GPU Performance Advisor☆65Updated 2 years ago
- ☆102Updated last year
- A Top-Down Profiler for GPU Applications☆20Updated last year
- Advanced Profiling and Analytics for AMD Hardware☆159Updated this week
- IREE plugin repository for the AMD AIE accelerator☆98Updated this week
- Performance Prediction Toolkit for GPUs☆37Updated 3 years ago
- ☆260Updated last month
- An MLIR-based toy DL compiler for TVM Relay.☆58Updated 2 years ago
- ☆28Updated last week
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆131Updated 6 months ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆32Updated 4 years ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆66Updated 6 years ago
- MLIR Sample dialect☆123Updated 4 months ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆105Updated 4 months ago
- ☆247Updated last month