starpu-runtime / starpu
This is a mirror of https://gitlab.inria.fr/starpu/starpu where our development happens, but contributions are welcome here too!
☆69Updated this week
Alternatives and similar repositories for starpu:
Users that are interested in starpu are comparing it to the libraries listed below
- ☆46Updated this week
- RAJA Performance Suite☆118Updated this week
- Very-Low Overhead Checkpointing System☆57Updated 2 months ago
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆56Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆85Updated 2 weeks ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆31Updated 3 months ago
- High-performance, GPU-aware communication library☆85Updated 2 months ago
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆33Updated last week
- ROCm SPARSE marshalling library☆67Updated this week
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆43Updated this week
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆23Updated 4 months ago
- Next generation LAPACK implementation for ROCm platform☆99Updated this week
- MPI accelerator-integrated communication extensions☆32Updated 2 years ago
- A light-weight MPI profiler.☆90Updated 8 months ago
- A unified framework across multiple programming platforms☆36Updated 9 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆110Updated 2 months ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆106Updated 8 months ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆51Updated last month
- Distributed View Extension for Kokkos☆45Updated 4 months ago
- ☆17Updated last year
- Reusable software components for ROCm developers☆83Updated last week
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆107Updated last year
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆121Updated 2 months ago
- DLA-Future☆70Updated last week
- Advanced Profiling and Analytics for AMD Hardware☆144Updated this week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆50Updated this week