gthparch / CuPBoP-AMDLinks

CuPBoP-AMD is a CUDA translator that translates CUDA programs at NVVM IR level to HIP-compatible IR that can run on AMD GPUs.

☆37

Alternatives and similar repositories for CuPBoP-AMD

Users that are interested in CuPBoP-AMD are comparing it to the libraries listed below

Sorting:

cupbop / CuPBoP
A framework that support executing unmodified CUDA source code on non-NVIDIA devices.
☆132Updated 7 months ago
ROCm / rocWMMA
rocWMMA
☆121Updated this week
gpuocelot / gpuocelot
GPUOcelot: A dynamic compilation framework for PTX
☆207Updated 5 months ago
ROCm / amd_matrix_instruction_calculator
A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators
☆110Updated 2 months ago
sjfeng1999 / gpu-arch-microbenchmark
Dissecting NVIDIA GPU Architecture
☆103Updated 3 years ago
passlab / CUDAMicroBench
☆42Updated last month
0xD0GF00D / DocumentSASS
Unofficial description of the CUDA assembly (SASS) instruction sets.
☆132Updated 2 weeks ago
ProjectPhysX / PTXprofiler
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.
☆55Updated 4 months ago
accel-sim / gpu-app-collection
A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.
☆71Updated this week
QianyanTech / NBAssembler
Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.
☆82Updated 2 years ago
spcl / open-earth-compiler
development repository for the open earth compiler
☆80Updated 4 years ago
ROCm / rocMLIR
☆148Updated this week
mmperf / mmperf
MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.
☆134Updated last year
OpenGPGPU / opengpgpu
☆70Updated 9 months ago
carlushuang / gcnasm
amdgpu example code in hip/asm
☆36Updated last week
Xilinx / llvm-aie
Fork of LLVM to support AMD AIEngine processors
☆154Updated this week
intel / mlir-extensions
Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.
☆138Updated this week
sunlex0717 / DissectingTensorCores
☆106Updated last year
owensgroup / BGHT
BGHT: High-performance static GPU hash tables.
☆70Updated last month
gty111 / PTX-EMU
PTX-EMU is a simple emulator for CUDA program.
☆34Updated 3 months ago
wmmae / wmma_extension
An extension library of WMMA API (Tensor Core API)
☆99Updated last year
amd / amd-lab-notes
AMD lab notes with code examples to demonstrate use of AMD GPUs
☆100Updated last year
nod-ai / iree-amd-aie
IREE plugin repository for the AMD AIE accelerator
☆100Updated this week
SunsetQuest / CudaPAD
CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.
☆119Updated 2 years ago
ekondis / gpumembench
A GPU benchmark suite for assessing on-chip GPU memory bandwidth
☆106Updated 7 years ago
unisa-hpc / sycl-bench
SYCL Benchmark Suite
☆65Updated last month
Xilinx / mlir-air
☆103Updated last week
CHIP-SPV / chipStar
chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.
☆295Updated this week
intel / xetla
☆62Updated 7 months ago
daadaada / turingas
Assembler for NVIDIA Volta and Turing GPUs
☆226Updated 3 years ago