BenjaminW3 / matmulLinks
Sequential and parallel GEMM implementations with C interface + Benchmark.
☆12Updated 9 years ago
Alternatives and similar repositories for matmul
Users that are interested in matmul are comparing it to the libraries listed below
Sorting:
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆40Updated 4 years ago
- libSplash - Simple Parallel file output Library for Accumulating Simulation data using Hdf5☆16Updated 4 years ago
- Distributed View Extension for Kokkos☆49Updated last year
- Vectorised data model base and helper classes.☆20Updated last week
- A mirror of cinch's internal gitlab repository.☆21Updated 3 years ago
- Portable HPC Containers (C++)☆49Updated last week
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆23Updated last year
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Updated last week
- Autonomic Performance Environment for eXascale (APEX)☆50Updated 6 months ago
- MiniFE Finite Element Mini-Application☆38Updated last year
- Implementation of AMD HIP for CPUs☆22Updated 5 years ago
- Automatically exported from code.google.com/p/patus☆16Updated 10 years ago
- ☆11Updated 4 years ago
- A mirror of FleCSI's internal gitlab repository.☆68Updated 4 years ago
- Department of Energy Standard Utility Library☆33Updated last week
- A Monte Carlo transport mini-app for studying new parallel algorithms☆18Updated last month
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆12Updated 3 months ago
- In Situ Animation of Accelerated Computations☆29Updated 7 months ago
- Aries Network Performance Counters Monitoring Library☆11Updated 5 years ago
- A neutral particle transport mini-app to study performance of sweeps on unstructured, 3D tetrahedral meshes.☆19Updated 3 years ago
- I/O Mini-apps☆19Updated 4 years ago
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆40Updated 3 weeks ago
- OpenMP vs Offload☆23Updated 2 years ago
- mallocMC: Memory Allocator for Many Core Architectures☆58Updated last week
- ☆36Updated 2 weeks ago
- Comb is a communication performance benchmarking tool.☆26Updated 2 years ago
- Structured PIC proxy app based on Cabana☆15Updated 7 months ago
- ReMPI (MPI Record-and-Replay)☆40Updated last year
- GPI-2☆57Updated 2 months ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆111Updated this week