csc-training / hip-programming
☆11Updated this week
Alternatives and similar repositories for hip-programming:
Users that are interested in hip-programming are comparing it to the libraries listed below
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆49Updated this week
- Training examples for SYCL☆39Updated 2 months ago
- HPCG benchmark based on ROCm platform☆37Updated 2 weeks ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆21Updated last year
- ☆38Updated 3 years ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- ☆35Updated this week
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆24Updated 6 years ago
- ☆45Updated this week
- PanguLU: A Scalable Regular Two-Dimensional Block-Cyclic Sparse Direct Solver on Distributed Heterogeneous Systems☆36Updated 3 months ago
- Next generation library for iterative sparse solvers for ROCm platform☆78Updated this week
- This repository contains application codes and solutions for the Book on "OpenACC for Programmers - Concept & Strategies".☆34Updated 6 years ago
- Fortran interfaces for ROCm libraries☆74Updated this week
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆60Updated last week
- ☆76Updated this week
- Examples illustrating usage of the rocBLAS library☆14Updated 7 months ago
- RAJA Performance Suite☆118Updated this week
- ALCF Computational Performance Workshop☆37Updated 2 years ago
- Next generation SPARSE implementation for ROCm platform☆119Updated this week
- Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner☆20Updated 11 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆204Updated 3 months ago
- OpenACC* to OpenMP* API assisting migration tool☆35Updated 5 months ago
- Sources for the Oak Ridge Leadership Computing Facility User Documentation☆64Updated this week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆66Updated this week
- Next generation LAPACK implementation for ROCm platform☆99Updated this week
- Subset of BLAS routines optimized for NVIDIA GPUs☆68Updated 2 years ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- High-performance, GPU-aware communication library☆85Updated 2 months ago
- A mini-app to represent the multipole resonance representation lookup cross section algorithm.☆23Updated last year