larsgeb / m1-gpu-cpp
Metal Shading Language on Apple M1's GPU for scientific C++.
☆82Updated last year
Related projects ⓘ
Alternatives and complementary repositories for m1-gpu-cpp
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆124Updated last year
- Scientific computing with Metal in C++: Matrix multiplication example☆22Updated 2 years ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆44Updated last month
- CS infrastructure components for HPC applications☆157Updated this week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆93Updated 3 weeks ago
- Next generation LAPACK implementation for ROCm platform☆94Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 2 months ago
- C++ HPC Tutorial materials☆48Updated 4 months ago
- RAJA Performance Suite☆110Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆311Updated this week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆100Updated this week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆100Updated last year
- Examples for HIP☆200Updated 2 weeks ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆112Updated 2 months ago
- Performance-portable geometric search library☆184Updated last week
- Structured Matrix Package (LBNL)☆167Updated 2 weeks ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week
- Examples for using SYCL on CUDA☆60Updated 2 weeks ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆92Updated 2 years ago
- A massively-parallel, block-sparse tensor framework written in C++☆259Updated this week
- Performance-portable library for particle-based simulations☆215Updated last month
- Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem☆296Updated 2 months ago
- Next generation library for iterative sparse solvers for ROCm platform☆76Updated this week
- A C++17 message passing library based on MPI☆168Updated 9 months ago
- Reusable software components for ROCm developers☆79Updated this week
- A flyweight in situ visualization and analysis runtime for multi-physics HPC simulations☆198Updated this week
- CSC Summer School in High-Performance Computing☆93Updated 4 months ago
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆57Updated last week
- Accelerated finite element flow solvers☆147Updated last week
- An implementation of BLAS using the SYCL open standard.☆259Updated 2 weeks ago