ROCm / HIPIFY
HIPIFY: Convert CUDA to Portable C++ Code
☆552Updated this week
Alternatives and similar repositories for HIPIFY:
Users that are interested in HIPIFY are comparing it to the libraries listed below
- CUDA Kernel Benchmarking Library☆561Updated 3 months ago
- Examples for HIP☆202Updated 2 months ago
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- Next generation BLAS implementation for ROCm platform☆359Updated this week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆382Updated last month
- AMD's graph optimization engine.☆208Updated this week
- ☆250Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆349Updated this week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆251Updated this week
- A collection of examples for the ROCm software stack☆186Updated this week
- ROCm BLAS marshalling library☆131Updated last week
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆135Updated this week
- ROCm Communication Collectives Library (RCCL)☆297Updated this week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆426Updated last year
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆347Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆235Updated this week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆263Updated last month
- ☆117Updated this week
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆211Updated this week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆610Updated 3 months ago
- ☆515Updated this week
- An implementation of HIP that works on CPUs, across OSes.☆115Updated 11 months ago
- CUDA Core Compute Libraries☆1,468Updated this week
- collection of benchmarks to measure basic GPU capabilities☆296Updated last week
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆434Updated 4 years ago
- Next generation FFT implementation for ROCm☆188Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆521Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆166Updated last week
- OpenAI Triton backend for Intel® GPUs☆165Updated this week
- Kernel Tuner☆311Updated last week