BenjaminW3 / matmul
Sequential and parallel GEMM implementations with C interface + Benchmark.
☆12Updated 8 years ago
Alternatives and similar repositories for matmul:
Users that are interested in matmul are comparing it to the libraries listed below
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆40Updated 3 years ago
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆22Updated 3 months ago
- Autonomic Performance Environment for eXascale (APEX)☆42Updated this week
- Material for a course about the use of modern C++ for scientific computation☆18Updated last month
- An OpenMP runtime implemented using HPX☆23Updated 2 years ago
- Experimental MPI Wrapper for Kokkos☆16Updated last month
- libSplash - Simple Parallel file output Library for Accumulating Simulation data using Hdf5☆15Updated 3 years ago
- In Situ Animation of Accelerated Computations☆25Updated 8 months ago
- mirror from http://lotsofcores.com book 2, since dropbox isn't good for everyone☆38Updated 8 years ago
- Structured PIC proxy app based on Cabana☆13Updated last month
- Distributed View Extension for Kokkos☆43Updated last month
- A neutral particle transport mini-app to study performance of sweeps on unstructured, 3D tetrahedral meshes.☆18Updated 2 years ago
- Vectorised data model base and helper classes.☆20Updated last week
- Implementation of AMD HIP for CPUs☆22Updated 4 years ago
- Advanced MPI bindings for C++☆38Updated 11 years ago
- Header-only C++20 wrapper for MPI 4.0.☆14Updated last year
- Department of Energy Standard Utility Library☆30Updated 4 months ago
- ☆19Updated 3 years ago
- Automatically exported from code.google.com/p/patus☆15Updated 9 years ago
- Performance-portable C++ code for simulating elastic shear waves in an axisymmetric domain.☆13Updated 2 years ago
- Repository for collecting, curating and maintaining up to date CMake scripts.☆9Updated 3 years ago
- ☆11Updated 3 years ago
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆17Updated 5 years ago
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆37Updated 3 weeks ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 3 months ago
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆27Updated 3 years ago
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆20Updated 3 years ago
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆36Updated 4 years ago
- Molecular dynamics proxy application based on Kokkos☆31Updated 6 months ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆11Updated this week