BenjaminW3 / matmul
Sequential and parallel GEMM implementations with C interface + Benchmark.
☆12Updated 8 years ago
Alternatives and similar repositories for matmul:
Users that are interested in matmul are comparing it to the libraries listed below
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆22Updated 5 months ago
- Autonomic Performance Environment for eXascale (APEX)☆44Updated this week
- A neutral particle transport mini-app to study performance of sweeps on unstructured, 3D tetrahedral meshes.☆18Updated 2 years ago
- ☆11Updated 3 years ago
- Aries Network Performance Counters Monitoring Library☆11Updated 4 years ago
- Experimental Linear Algebra Performance Studies☆12Updated 8 years ago
- Experimental MPI Wrapper for Kokkos☆19Updated 3 weeks ago
- Distributed View Extension for Kokkos☆45Updated 3 months ago
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆40Updated 3 years ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated 6 months ago
- Molecular dynamics proxy application based on Kokkos☆32Updated 8 months ago
- Structured PIC proxy app based on Cabana☆14Updated last month
- Department of Energy Standard Utility Library☆31Updated 3 weeks ago
- An OpenMP runtime implemented using HPX☆23Updated 2 years ago
- Vectorised data model base and helper classes.☆19Updated 3 weeks ago
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆30Updated this week
- List all available information about all SYCL devices and platforms☆15Updated 4 years ago
- MiniFE Finite Element Mini-Application☆31Updated 10 months ago
- Header-only C++20 wrapper for MPI 4.0.☆15Updated last year
- libSplash - Simple Parallel file output Library for Accumulating Simulation data using Hdf5☆16Updated 3 years ago
- Comb is a communication performance benchmarking tool.☆24Updated 2 years ago
- OpenMP vs Offload☆21Updated last year
- DARMA/vt => Virtual Transport☆36Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆84Updated last week
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆36Updated 4 years ago
- Material for a course about the use of modern C++ for scientific computation☆20Updated 3 months ago
- Implementation of AMD HIP for CPUs☆22Updated 4 years ago
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- C++ User interface for the Platform independent Library Alpaka☆38Updated 7 months ago