BenjaminW3 / matmul
Sequential and parallel GEMM implementations with C interface + Benchmark.
☆12Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for matmul
- Autonomic Performance Environment for eXascale (APEX)☆38Updated last week
- A neutral particle transport mini-app to study performance of sweeps on unstructured, 3D tetrahedral meshes.☆17Updated 2 years ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆34Updated last month
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆21Updated 3 weeks ago
- Aries Network Performance Counters Monitoring Library☆11Updated 3 years ago
- ☆11Updated 3 years ago
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆38Updated 3 years ago
- libSplash - Simple Parallel file output Library for Accumulating Simulation data using Hdf5☆15Updated 3 years ago
- Department of Energy Standard Utility Library☆30Updated 2 months ago
- Vectorised data model base and helper classes.☆19Updated this week
- Distributed View Extension for Kokkos☆43Updated 2 months ago
- OpenMP vs Offload☆21Updated last year
- Material for a course about the use of modern C++ for scientific computation☆14Updated 5 months ago
- A mirror of cinch's internal gitlab repository.☆22Updated 2 years ago
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆36Updated 4 months ago
- In Situ Animation of Accelerated Computations☆25Updated 6 months ago
- ☆19Updated 3 years ago
- Structured PIC proxy app based on Cabana☆13Updated last month
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆39Updated 9 months ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆54Updated this week
- Multi-dimensional C++ arrays which store objects in a Struct-of-Arrays (SoA) memory layout for efficient vectorization and zero address g…☆36Updated 4 years ago
- An OpenMP runtime implemented using HPX☆23Updated 2 years ago
- A mirror of FleCSI's internal gitlab repository.☆67Updated 3 years ago
- Header-only plugin for the Google Test framework defining listener(s) emitting sensible output when testing MPI-based, distributed-memory…☆20Updated 3 years ago
- Portable HPC Containers (C++)☆48Updated this week
- ☆17Updated 9 months ago
- Advanced MPI bindings for C++☆37Updated 11 years ago
- ☆4Updated 7 months ago
- A Monte Carlo transport mini-app for studying new parallel algorithms☆17Updated 3 weeks ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆57Updated last week