larsgeb / m1-gpu-cpp
Metal Shading Language on Apple M1's GPU for scientific C++.
☆89Updated last year
Alternatives and similar repositories for m1-gpu-cpp:
Users that are interested in m1-gpu-cpp are comparing it to the libraries listed below
- Scientific computing with Metal in C++: Matrix multiplication example☆28Updated 2 years ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆121Updated 2 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆110Updated 2 months ago
- ☆38Updated last month
- ☆44Updated last month
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆333Updated this week
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆135Updated 2 years ago
- Performance-portable geometric search library☆197Updated this week
- CS infrastructure components for HPC applications☆169Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆51Updated last month
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- Astrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees☆51Updated 2 weeks ago
- A shared-memory FFT for the Kokkos ecosystem☆31Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated 6 months ago
- Molecular dynamics proxy application based on Kokkos☆32Updated 8 months ago
- DDC is a discrete domain computation library.☆35Updated this week
- Next generation LAPACK implementation for ROCm platform☆99Updated this week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆106Updated last year
- CSC Summer School in High-Performance Computing☆100Updated 2 months ago
- Performance-portable library for particle-based simulations☆230Updated 2 weeks ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆203Updated 3 months ago
- DLA-Future☆70Updated this week
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆62Updated 2 months ago
- Local and distributed octrees based on Morton codes with halo discovery and exchange with a 3D collision detection algorithm☆41Updated last month
- C++ HPC Tutorial materials☆48Updated 8 months ago
- Abstraction Library for Parallel Kernel Acceleration☆372Updated this week
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆77Updated 3 weeks ago
- Examples for using SYCL on CUDA☆62Updated 3 weeks ago
- Structured Matrix Package (LBNL)☆173Updated 3 months ago
- Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem☆316Updated 3 weeks ago