alucas / StarPU
StarPU Runtime system
☆16Updated 14 years ago
Alternatives and similar repositories for StarPU:
Users that are interested in StarPU are comparing it to the libraries listed below
- A task benchmark☆41Updated 6 months ago
- TTG: Template Task Graph C++ API☆18Updated last week
- Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour u…☆44Updated 5 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆105Updated last year
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆23Updated 3 months ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆21Updated 6 years ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆25Updated 3 weeks ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆91Updated 5 months ago
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆33Updated 2 years ago
- Autonomic Performance Environment for eXascale (APEX)☆43Updated last week
- pLiner is a framework that helps programmers identify locations in the source of numerical code that are highly affected by compiler opti…☆17Updated last year
- MPI accelerator-integrated communication extensions☆32Updated last year
- Scalable GPU Kernel Fission/Fusion Transformation for Memory-Bound Kernels☆13Updated 9 years ago
- Parallel Tensor Infrastructure (ParTI!)☆28Updated 4 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- Global Memory and Threading runtime system☆23Updated 9 months ago
- RAJA Performance Suite☆118Updated this week
- SST Macro Element Library☆36Updated 4 months ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- DLA-Future☆69Updated this week
- High-performance, GPU-aware communication library☆84Updated last month
- Open source of an IBM Optimized version of the HPCG benchmark.☆14Updated 11 months ago
- Simplified Interface to Complex Memory☆27Updated last year
- sparse matrix pre-processing library☆81Updated 9 months ago
- Comb is a communication performance benchmarking tool.☆24Updated last year
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 7 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆39Updated last year
- Official BOLT Repository☆28Updated 6 months ago
- ☆34Updated 4 years ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 5 years ago