larsgeb / m1-gpu-cppLinks
Metal Shading Language on Apple M1's GPU for scientific C++.
☆102Updated last year
Alternatives and similar repositories for m1-gpu-cpp
Users that are interested in m1-gpu-cpp are comparing it to the libraries listed below
Sorting:
- Scientific computing with Metal in C++: Matrix multiplication example☆40Updated 3 years ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆144Updated 2 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆129Updated 4 months ago
- ☆80Updated this week
- QUDA is a library for performing calculations in lattice QCD on GPUs.☆328Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆133Updated this week
- CS infrastructure components for HPC applications☆176Updated this week
- Examples for using SYCL on CUDA☆62Updated last month
- Software library for FDTD of viscoelastic equation using a staggered grid arrangement with support for GPU and CPU backends☆56Updated 3 weeks ago
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆359Updated this week
- CSC Summer School in High-Performance Computing☆114Updated 3 months ago
- Performance-portable library for particle-based simulations☆251Updated 2 weeks ago
- A shared-memory FFT for the Kokkos ecosystem☆44Updated this week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆111Updated 2 years ago
- This is a mirror of https://gitlab.inria.fr/starpu/starpu where our development happens, but contributions are welcome here too!☆76Updated this week
- A website covering major HPC technologies, designed to welcome contributions.☆77Updated last year
- Performance-portable geometric search library☆209Updated 2 weeks ago
- Public repository for vol 2 of The Art of HPC: parallel programming☆88Updated this week
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆209Updated 5 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆61Updated this week
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆430Updated this week
- Exercises and Solutions for "Programming Your GPU with OpenMP: A Hands-On Introduction"☆147Updated 6 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated 2 months ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆86Updated last month
- ScaLAPACK development repository☆158Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆115Updated this week
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆69Updated last week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 3 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆91Updated this week
- DLA-Future☆78Updated last week