larsgeb / m1-gpu-cppLinks
Metal Shading Language on Apple M1's GPU for scientific C++.
☆97Updated last year
Alternatives and similar repositories for m1-gpu-cpp
Users that are interested in m1-gpu-cpp are comparing it to the libraries listed below
Sorting:
- Scientific computing with Metal in C++: Matrix multiplication example☆39Updated 2 years ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆145Updated 2 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆125Updated 3 months ago
- ☆73Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆134Updated this week
- C++ HPC Tutorial materials☆55Updated last year
- A shared-memory FFT for the Kokkos ecosystem☆43Updated this week
- CS infrastructure components for HPC applications☆173Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆352Updated last week
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆69Updated 2 weeks ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆94Updated 3 years ago
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆67Updated 3 months ago
- C++ Template Linear Algebra PACKage☆50Updated last week
- Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"☆146Updated 5 months ago
- Examples for using SYCL on CUDA☆62Updated 2 months ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆23Updated last year
- Performance-portable geometric search library☆209Updated 2 weeks ago
- Astrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees☆53Updated last month
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆60Updated this week
- Next generation LAPACK implementation for ROCm platform☆110Updated this week
- Structured Matrix Package (LBNL)☆176Updated last week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆90Updated 2 weeks ago
- ☆44Updated 4 months ago
- Header-only C++20 wrapper for MPI 4.0.☆47Updated last year
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆84Updated 2 months ago
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆431Updated last week
- CSC Summer School in High-Performance Computing☆113Updated last month
- Software to support people learning OpenMP with our book ... The OpenMP Common Core: Making OpenMP Simple Again☆83Updated last year
- QUDA is a library for performing calculations in lattice QCD on GPUs.☆332Updated this week
- Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends☆55Updated this week