starpu-runtime / starpu
This is a mirror of https://gitlab.inria.fr/starpu/starpu where our development happens, but contributions are welcome here too!
☆61Updated this week
Related projects: ⓘ
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆84Updated 2 months ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆27Updated 3 weeks ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated last year
- Kokkos Remote Spaces implements distributed Kokkos Views and related APIs for distributed parallel programming.☆42Updated 2 weeks ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆109Updated 2 weeks ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆21Updated last week
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆48Updated this week
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆103Updated last month
- RAJA Performance Suite☆110Updated last week
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆38Updated this week
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆23Updated 4 months ago
- TTG: Template Task Graph C++ API☆18Updated last month
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆104Updated last week
- A unified framework across multiple programming platforms☆28Updated 3 months ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆78Updated last month
- Comb is a communication performance benchmarking tool.☆23Updated last year
- Instrumentation framework to generate execution traces of the most used parallel runtimes.☆60Updated last week
- Very-Low Overhead Checkpointing System☆52Updated 3 months ago
- A light-weight MPI profiler.☆77Updated last month
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆97Updated last year
- ☆40Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆74Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆34Updated 8 months ago
- The SCMC and PSCMC programming language☆17Updated last year
- MPI accelerator-integrated communication extensions☆33Updated last year
- Portable HPC Containers (C++)☆47Updated 2 weeks ago
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆52Updated last week
- Implementation of a cool communication layer☆14Updated 3 weeks ago
- High-performance, GPU-aware communication library☆85Updated last month
- PMIx Reference RunTime Environment (PRRTE)☆35Updated this week