bsc-pm / tampiLinks
The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the interoperability between parallel task-based programming models and MPI operations
☆24Updated last month
Alternatives and similar repositories for tampi
Users that are interested in tampi are comparing it to the libraries listed below
Sorting:
- Logger for MPI communication☆27Updated 2 years ago
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆65Updated last month
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆36Updated this week
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- Pragmatic, Productive, and Portable Affinity for HPC☆41Updated last month
- Very-Low Overhead Checkpointing System☆58Updated 6 months ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆44Updated last year
- MPI accelerator-integrated communication extensions☆36Updated 2 years ago
- ☆17Updated this week
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆25Updated last month
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆48Updated this week
- ☆18Updated last year
- Official BOLT Repository☆30Updated 10 months ago
- The ultimate memory bandwidth benchmark☆50Updated 5 months ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆82Updated last year
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆59Updated 2 weeks ago
- MPI benchmark to test and measure collective performance☆51Updated 4 years ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- A Micro-benchmarking Tool for HPC Networks☆30Updated last week
- OpenMP vs Offload☆22Updated 2 years ago
- RAJA Performance Suite☆117Updated this week
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆36Updated 8 months ago
- Nanos6 is a runtime that implements the OmpSs-2 parallel programming model, developed by the System Tools and Advanced Runtimes (STAR) gr…☆21Updated last month
- A light-weight MPI profiler.☆95Updated 11 months ago
- Distributed View Extension for Kokkos☆47Updated 7 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆33Updated 3 months ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆71Updated last week
- Instrumentation framework to generate execution traces of the most used parallel runtimes.☆70Updated last week
- Integrated Performance Monitoring for High Performance Computing☆89Updated 3 years ago
- Intermediate MPI lesson☆28Updated 2 years ago