ROCm / libhipcxx
The C++ Standard Library for your entire system.
☆15Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for libhipcxx
- ☆17Updated 9 months ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆34Updated last month
- Experimental MPI Wrapper for Kokkos☆16Updated last week
- Department of Energy Standard Utility Library☆30Updated 2 months ago
- Distributed View Extension for Kokkos☆43Updated 2 months ago
- Comb is a communication performance benchmarking tool.☆23Updated last year
- A tracing infrastructure for heterogeneous computing applications.☆23Updated this week
- Public proposals, extensions, information and materials from the SYCL working group☆14Updated 9 months ago
- Synchronous, single-threaded, library-only SYCL implementation for debugging and verification.☆27Updated last month
- ☆14Updated 4 years ago
- Advanced Profiling and Analytics for AMD Hardware☆135Updated this week
- Vectorised data model base and helper classes.☆19Updated this week
- Training examples for SYCL☆38Updated 2 weeks ago
- HPCG benchmark based on ROCm platform☆35Updated 2 weeks ago
- Autonomic Performance Environment for eXascale (APEX)☆38Updated 2 weeks ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆43Updated 3 weeks ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆57Updated 2 weeks ago
- SYCL Benchmark Suite☆56Updated 2 months ago
- OpenMP vs Offload☆21Updated last year
- Reusable software components for ROCm developers☆79Updated this week
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆21Updated last month
- A shared-memory FFT for the Kokkos ecosystem☆24Updated last week
- ☆11Updated 4 months ago
- Implementation of AMD HIP for CPUs☆22Updated 4 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆22Updated 2 months ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆99Updated this week
- Header-only C++20 wrapper for MPI 4.0.☆14Updated last year
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆92Updated 2 years ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆46Updated last week
- RAJA Performance Suite☆110Updated last week