gthparch / CuPBoP-AMD
CuPBoP-AMD is a CUDA translator that translates CUDA programs at NVVM IR level to HIP-compatible IR that can run on AMD GPUs.
☆36Updated last year
Alternatives and similar repositories for CuPBoP-AMD:
Users that are interested in CuPBoP-AMD are comparing it to the libraries listed below
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆126Updated 4 months ago
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆78Updated 2 years ago
- Dissecting NVIDIA GPU Architecture☆92Updated 2 years ago
- ☆142Updated this week
- ☆39Updated 3 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆38Updated 3 years ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆90Updated last month
- An extension library of WMMA API (Tensor Core API)☆96Updated 9 months ago
- rocWMMA☆110Updated last week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last month
- ☆96Updated last year
- ☆66Updated 6 months ago
- CUDA PTX-ISA Document 中文翻译版☆38Updated last month
- SYCL Reference Manual☆27Updated last year
- Tenstorrent MLIR compiler☆122Updated this week
- ☆33Updated 3 years ago
- PTX-EMU is a simple emulator for CUDA program.☆31Updated 2 weeks ago
- ☆95Updated last week
- amdgpu example code in hip/asm☆31Updated 3 weeks ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆134Updated last week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- ☆44Updated 4 years ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆82Updated this week
- ☆60Updated 4 months ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆18Updated last week
- GPU Performance Advisor☆64Updated 2 years ago
- GPUOcelot: A dynamic compilation framework for PTX☆187Updated 3 months ago
- SYCL Benchmark Suite☆64Updated 2 months ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆84Updated last year
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆53Updated 2 weeks ago