larsgeb / m1-gpu-cppLinks
Metal Shading Language on Apple M1's GPU for scientific C++.
☆93Updated last year
Alternatives and similar repositories for m1-gpu-cpp
Users that are interested in m1-gpu-cpp are comparing it to the libraries listed below
Sorting:
- Scientific computing with Metal in C++: Matrix multiplication example☆31Updated 2 years ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆144Updated 2 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆120Updated 3 weeks ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆56Updated 2 months ago
- C++ Template Linear Algebra PACKage☆46Updated last week
- ☆42Updated 2 months ago
- A shared-memory FFT for the Kokkos ecosystem☆37Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆130Updated last week
- N-Ways to Multi-GPU Programming☆34Updated 2 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆81Updated this week
- Next generation LAPACK implementation for ROCm platform☆103Updated last week
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆21Updated last year
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆54Updated 3 months ago
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆65Updated last month
- CS infrastructure components for HPC applications☆172Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆87Updated last week
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆63Updated 2 months ago
- Distributed View Extension for Kokkos☆46Updated 6 months ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆47Updated this week
- ☆60Updated last month
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 2 months ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated 2 years ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆68Updated 3 weeks ago
- ☆29Updated 2 weeks ago
- Header-only C++20 wrapper for MPI 4.0.☆47Updated last year
- SYCL materials for ENCCS workshop☆26Updated 2 years ago
- Examples for using SYCL on CUDA☆62Updated 2 weeks ago
- Performance-portable geometric search library☆207Updated last week
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆79Updated last week
- Molecular dynamics proxy application based on Kokkos☆33Updated 11 months ago