larsgeb / m1-gpu-cppLinks
Metal Shading Language on Apple M1's GPU for scientific C++.
☆106Updated 2 years ago
Alternatives and similar repositories for m1-gpu-cpp
Users that are interested in m1-gpu-cpp are comparing it to the libraries listed below
Sorting:
- Scientific computing with Metal in C++: Matrix multiplication example☆46Updated 3 years ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆149Updated 3 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆131Updated 3 months ago
- ☆101Updated this week
- Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends☆58Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆138Updated 2 weeks ago
- CS infrastructure components for HPC applications☆181Updated last week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 4 years ago
- Examples for using SYCL on CUDA☆62Updated 4 months ago
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆372Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆95Updated last week
- Structured Matrix Package (LBNL)☆185Updated 4 months ago
- Software to support people learning OpenMP with our book ... The OpenMP Common Core: Making OpenMP Simple Again☆83Updated 2 years ago
- C++ HPC Tutorial materials☆54Updated 3 months ago
- Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"☆152Updated 10 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆114Updated 2 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆97Updated last month
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆75Updated 3 months ago
- Performance-portable geometric search library☆219Updated 2 weeks ago
- DLA-Future☆82Updated 2 months ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆23Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆114Updated last week
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆434Updated 2 months ago
- A shared-memory FFT for the Kokkos ecosystem☆46Updated this week
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated 6 months ago
- Header-only C++20 wrapper for MPI 4.0.☆47Updated 2 years ago
- ScaLAPACK development repository☆160Updated last week
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆92Updated 3 months ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆80Updated 5 months ago
- Running linear algebra as fast as possible on Apple silicon☆28Updated 2 years ago