bsc-pm / tampi
The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the interoperability between parallel task-based programming models and MPI operations
☆24Updated 4 months ago
Alternatives and similar repositories for tampi:
Users that are interested in tampi are comparing it to the libraries listed below
- Logger for MPI communication☆26Updated last year
- Comb is a communication performance benchmarking tool.☆24Updated 2 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆43Updated this week
- Distributed View Extension for Kokkos☆45Updated 4 months ago
- OpenMP vs Offload☆21Updated last year
- Very-Low Overhead Checkpointing System☆57Updated 3 months ago
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆34Updated 5 months ago
- Training examples for SYCL☆40Updated this week
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆33Updated last week
- ☆17Updated last year
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆25Updated this week
- MPI accelerator-integrated communication extensions☆33Updated 2 years ago
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆39Updated 3 months ago
- MPI wrapper generator, for writing PMPI tool libraries☆34Updated 3 weeks ago
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆56Updated 2 weeks ago
- RAJA Performance Suite☆119Updated this week
- Algebraic multigrid benchmark☆33Updated 9 months ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆58Updated last week
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated last week
- A Multi-purpose, Application-Centric, Scalable I/O Proxy Application☆34Updated 4 years ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆64Updated 3 weeks ago
- Department of Energy Standard Utility Library☆31Updated last month
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆78Updated last year
- Nanos6 is a runtime that implements the OmpSs-2 parallel programming model, developed by the System Tools and Advanced Runtimes (STAR) gr…☆20Updated 4 months ago
- A light-weight MPI profiler.☆93Updated 8 months ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated last week
- Molecular dynamics proxy application based on Kokkos☆32Updated 9 months ago
- ☆11Updated 3 years ago
- Official BOLT Repository☆28Updated 7 months ago