larsgeb / m1-gpu-cppLinks
Metal Shading Language on Apple M1's GPU for scientific C++.
☆101Updated 2 years ago
Alternatives and similar repositories for m1-gpu-cpp
Users that are interested in m1-gpu-cpp are comparing it to the libraries listed below
Sorting:
- Scientific computing with Metal in C++: Matrix multiplication example☆43Updated 3 years ago
- ☆80Updated this week
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆145Updated 2 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆130Updated 2 weeks ago
- Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends☆56Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆364Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆135Updated 2 weeks ago
- C++ HPC Tutorial materials☆55Updated last week
- CS infrastructure components for HPC applications☆176Updated last week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆112Updated 2 years ago
- Performance-portable geometric search library☆211Updated this week
- Examples for using SYCL on CUDA☆62Updated 2 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆54Updated 3 months ago
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆432Updated 2 weeks ago
- ☆45Updated 6 months ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆23Updated last year
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 3 years ago
- QUDA is a library for performing calculations in lattice QCD on GPUs.☆328Updated last week
- Next generation library for iterative sparse solvers for ROCm platform☆89Updated this week
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆71Updated last week
- DLA-Future☆80Updated this week
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆210Updated this week
- Performance-portable library for particle-based simulations☆251Updated this week
- A massively-parallel, block-sparse tensor framework written in C++☆309Updated last month
- Public repository for vol 2 of The Art of HPC: parallel programming☆91Updated 3 weeks ago
- Numerical linear algebra software package☆522Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆61Updated last week
- Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem☆350Updated last month
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆51Updated this week
- Distributed memory, MPI based SuperLU☆212Updated last week