gthparch / CuPBoP-AMDLinks
CuPBoP-AMD is a CUDA translator that translates CUDA programs at NVVM IR level to HIP-compatible IR that can run on AMD GPUs.
☆37Updated last year
Alternatives and similar repositories for CuPBoP-AMD
Users that are interested in CuPBoP-AMD are comparing it to the libraries listed below
Sorting:
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆132Updated 7 months ago
- rocWMMA☆125Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆207Updated 6 months ago
- ☆42Updated 2 months ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆55Updated 5 months ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆112Updated 3 months ago
- ☆147Updated this week
- BGHT: High-performance static GPU hash tables.☆71Updated last month
- Dissecting NVIDIA GPU Architecture☆104Updated 3 years ago
- ☆71Updated 10 months ago
- development repository for the open earth compiler☆80Updated 4 years ago
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆136Updated last month
- An extension library of WMMA API (Tensor Core API)☆102Updated last year
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆72Updated 3 weeks ago
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆84Updated 2 years ago
- ☆106Updated last year
- amdgpu example code in hip/asm☆38Updated last week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆141Updated last week
- PTX-EMU is a simple emulator for CUDA program.☆34Updated 4 months ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆134Updated last year
- SYCL Benchmark Suite☆65Updated 2 months ago
- ☆62Updated 8 months ago
- ☆27Updated last year
- IREE plugin repository for the AMD AIE accelerator☆102Updated last week
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆84Updated last year
- ☆149Updated this week
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆43Updated 3 years ago
- Bandwidth test for ROCm☆65Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆101Updated last week
- ☆54Updated 5 years ago