amd / UIF
☆58Updated last year
Alternatives and similar repositories for UIF:
Users that are interested in UIF are comparing it to the libraries listed below
- rocWMMA☆100Updated this week
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated 4 months ago
- The Riallto Open Source Project from AMD☆71Updated 3 months ago
- IREE plugin repository for the AMD AIE accelerator☆79Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆74Updated last year
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆79Updated this week
- ☆81Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- ☆60Updated 2 months ago
- ☆137Updated this week
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆67Updated this week
- ☆89Updated this week
- ☆44Updated 3 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆79Updated this week
- ROCm SPARSE marshalling library☆67Updated this week
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆37Updated 6 months ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆86Updated 4 months ago
- Fork of LLVM to support AMD AIEngine processors☆123Updated this week
- Bandwidth test for ROCm☆54Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated this week
- OpenAI Triton backend for Intel® GPUs☆165Updated this week
- GPTPU for SC 2021☆51Updated last year
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆50Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆235Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆135Updated this week
- AMD's graph optimization engine.☆208Updated this week
- Development repository for the Triton language and compiler☆108Updated this week
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆335Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆350Updated this week
- amdgpu example code in hip/asm☆28Updated last week