ICLDisco / parsec
PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core heterogeneous architectures. PaRSEC assigns computation threads to the cores, GPU accelerators, overlaps communications and computations and uses a dynamic, fully-distributed scheduler based on architectural fe…
☆48Updated this week
Related projects: ⓘ
- Training examples for SYCL☆38Updated 6 months ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆23Updated 4 months ago
- Molecular dynamics proxy application based on Kokkos☆30Updated 2 months ago
- RAJA Performance Suite☆110Updated last week
- Kokkos Remote Spaces implements distributed Kokkos Views and related APIs for distributed parallel programming.☆42Updated 2 weeks ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆34Updated 8 months ago
- DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems…☆11Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆109Updated 2 weeks ago
- A mini-app to represent the multipole resonance representation lookup cross section algorithm.☆21Updated 10 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆97Updated last year
- Logger for MPI communication☆26Updated last year
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆20Updated 2 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆84Updated 2 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆21Updated last week
- Comb is a communication performance benchmarking tool.☆23Updated last year
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆52Updated last week
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆36Updated 2 months ago
- ☆10Updated last month
- CPE change log and release notes☆26Updated 2 weeks ago
- MiniMD Molecular Dynamics Mini-App☆47Updated last month
- Algebraic multigrid benchmark☆28Updated 2 months ago
- Highly Efficient FFT for Exascale☆35Updated 4 months ago
- ☆14Updated 2 weeks ago
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆135Updated this week
- Yet Another Kernel Launcher: A simple C++ framework for performance portability and Fortran code porting☆55Updated last month
- Data parallel C++ mathematical object library☆154Updated 3 weeks ago
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆22Updated this week
- Run a parallel command inside a split tmux window☆135Updated 2 years ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆69Updated 6 months ago
- DDC is a discrete domain computation library.☆32Updated this week