ROCm / HIPIFY
HIPIFY: Convert CUDA to Portable C++ Code
☆523Updated this week
Related projects ⓘ
Alternatives and complementary repositories for HIPIFY
- A collection of examples for the ROCm software stack☆166Updated this week
- CUDA Kernel Benchmarking Library☆512Updated 2 weeks ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆309Updated this week
- ☆228Updated this week
- ROCm Communication Collectives Library (RCCL)☆265Updated this week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆363Updated 2 months ago
- Examples for HIP☆201Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆220Updated this week
- Next generation BLAS implementation for ROCm platform☆346Updated this week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆226Updated this week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆553Updated last week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆301Updated this week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆401Updated last year
- STREAM, for lots of devices written in many programming models☆325Updated 2 months ago
- AMD's graph optimization engine.☆185Updated this week
- An implementation of BLAS using the SYCL open standard.☆258Updated last week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆223Updated this week
- Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C+…☆1,381Updated this week
- Kernel Tuner☆285Updated this week
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆206Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆517Updated 5 months ago
- ROCm Parallel Primitives☆161Updated this week
- ☆482Updated this week
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆560Updated last month
- ROCm BLAS marshalling library☆118Updated this week
- collection of benchmarks to measure basic GPU capabilities☆264Updated 4 months ago
- CUDA Core Compute Libraries☆1,246Updated this week
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆122Updated this week
- ☆98Updated this week
- Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial☆179Updated this week