larsgeb / m1-gpu-cpp
Metal Shading Language on Apple M1's GPU for scientific C++.
☆93Updated last year
Alternatives and similar repositories for m1-gpu-cpp
Users that are interested in m1-gpu-cpp are comparing it to the libraries listed below
Sorting:
- Scientific computing with Metal in C++: Matrix multiplication example☆29Updated 2 years ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆139Updated 2 years ago
- C++ Template Linear Algebra PACKage☆43Updated last week
- ☆56Updated 2 weeks ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆55Updated 3 weeks ago
- CS infrastructure components for HPC applications☆171Updated this week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆117Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆123Updated 2 weeks ago
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆64Updated this week
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆22Updated last year
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated last month
- Header-only C++20 wrapper for MPI 4.0.☆46Updated last year
- Examples for using SYCL on CUDA☆62Updated 2 months ago
- Next generation library for iterative sparse solvers for ROCm platform☆81Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆87Updated last week
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆53Updated 2 months ago
- Benchmarking OpenBLAS on the Apple M1☆18Updated 4 years ago
- A shared-memory FFT for the Kokkos ecosystem☆35Updated last week
- Software to support people learning OpenMP with our book ... The OpenMP Common Core: Making OpenMP Simple Again☆82Updated last year
- Performance-portable library for particle-based simulations☆239Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆337Updated this week
- An implementation of HIP that works on CPUs, across OSes.☆116Updated last year
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆62Updated 3 weeks ago
- ☆39Updated 3 weeks ago
- Next generation LAPACK implementation for ROCm platform☆100Updated this week
- Examples from Programming in Parallel with CUDA☆143Updated 2 years ago
- OpenFPM: A scalable open framework for particle and particle-mesh codes on parallel computers☆19Updated 3 weeks ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- Astrophysics MHD simulation code optimized for large cluster of GPU☆58Updated 4 months ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆68Updated 2 months ago