BenjaminW3 / matmulLinks
Sequential and parallel GEMM implementations with C interface + Benchmark.
☆12Updated 9 years ago
Alternatives and similar repositories for matmul
Users that are interested in matmul are comparing it to the libraries listed below
Sorting:
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆40Updated 4 years ago
- A mirror of cinch's internal gitlab repository.☆21Updated 2 years ago
- libSplash - Simple Parallel file output Library for Accumulating Simulation data using Hdf5☆16Updated 4 years ago
- A neutral particle transport mini-app to study performance of sweeps on unstructured, 3D tetrahedral meshes.☆19Updated 3 years ago
- ☆11Updated 4 years ago
- Aries Network Performance Counters Monitoring Library☆11Updated 4 years ago
- Structured PIC proxy app based on Cabana☆15Updated 2 months ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆12Updated 3 weeks ago
- A Monte Carlo transport mini-app for studying new parallel algorithms☆17Updated 4 months ago
- Automatically exported from code.google.com/p/patus☆16Updated 10 years ago
- Logger for MPI communication☆27Updated 2 years ago
- ReMPI (MPI Record-and-Replay)☆39Updated last year
- Autonomic Performance Environment for eXascale (APEX)☆49Updated 2 months ago
- Distributed View Extension for Kokkos☆48Updated 9 months ago
- In Situ Animation of Accelerated Computations☆27Updated 3 months ago
- Experimental MPI Wrapper for Kokkos☆22Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 5 months ago
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆23Updated 11 months ago
- Department of Energy Standard Utility Library☆32Updated last month
- Vectorised data model base and helper classes.☆20Updated last week
- Experimental Linear Algebra Performance Studies☆12Updated 8 years ago
- Implementation of AMD HIP for CPUs☆23Updated 5 years ago
- An OpenMP runtime implemented using HPX☆24Updated 3 years ago
- OpenMP vs Offload☆22Updated 2 years ago
- Portable HPC Containers (C++)☆48Updated this week
- libhio is a library intended for writing data to hierarchical data store systems.☆20Updated 4 years ago
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆40Updated 2 months ago
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆36Updated 4 years ago
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆37Updated 10 months ago
- Dynamic Loop Self-scheduling For Load Balancing (DLS4LB) is an MPI-Based load balancing library. It is implemented in C and FORTRAN (F90)…☆16Updated 2 years ago