larsgeb / m1-gpu-cppLinks
Metal Shading Language on Apple M1's GPU for scientific C++.
☆104Updated 2 years ago
Alternatives and similar repositories for m1-gpu-cpp
Users that are interested in m1-gpu-cpp are comparing it to the libraries listed below
Sorting:
- Scientific computing with Metal in C++: Matrix multiplication example☆44Updated 3 years ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆149Updated 3 years ago
- CS infrastructure components for HPC applications☆179Updated this week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆131Updated last month
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆136Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆367Updated last week
- ☆90Updated last week
- Performance-portable geometric search library☆220Updated last week
- C++ HPC Tutorial materials☆54Updated last month
- Examples for using SYCL on CUDA☆62Updated 3 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆113Updated 2 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆95Updated 2 weeks ago
- Next generation library for iterative sparse solvers for ROCm platform☆89Updated last week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆53Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆64Updated last month
- clad -- automatic differentiation for C/C++☆378Updated last week
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆432Updated last month
- Performance-portable library for particle-based simulations☆255Updated last month
- A shared-memory FFT for the Kokkos ecosystem☆45Updated last week
- DLA-Future☆81Updated last month
- CSC Summer School in High-Performance Computing☆118Updated this week
- RAJA Performance Suite☆125Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated last month
- Abstraction Library for Parallel Kernel Acceleration☆397Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 4 years ago
- ☆47Updated last month
- Astrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees☆53Updated last month
- Smith is a high order nonlinear thermomechanical simulation code☆218Updated this week
- An implementation of HIP that works on CPUs, across OSes.☆131Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆115Updated last week