starpu-runtime / starpu
This is a mirror of https://gitlab.inria.fr/starpu/starpu where our development happens, but contributions are welcome here too!
☆68Updated this week
Alternatives and similar repositories for starpu:
Users that are interested in starpu are comparing it to the libraries listed below
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆104Updated 2 weeks ago
- RAJA Performance Suite☆118Updated last week
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆52Updated 3 weeks ago
- Advanced Profiling and Analytics for AMD Hardware☆139Updated this week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆39Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆79Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆116Updated last week
- Autonomic Performance Environment for eXascale (APEX)☆42Updated 2 weeks ago
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆27Updated 7 months ago
- ☆11Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆104Updated this week
- A unified framework across multiple programming platforms☆35Updated 7 months ago
- Next generation LAPACK implementation for ROCm platform☆98Updated this week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆102Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆83Updated this week
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆63Updated last week
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆106Updated 6 months ago
- Distributed View Extension for Kokkos☆43Updated last month
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆45Updated last week
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆25Updated this week
- Reusable software components for ROCm developers☆81Updated this week
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆48Updated this week
- DLA-Future☆69Updated this week
- High-performance, GPU-aware communication library☆84Updated 3 weeks ago
- AMD’s C++ library for accelerating tensor primitives☆38Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 4 months ago
- Portable HPC Containers (C++)☆48Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆46Updated 3 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆198Updated last month