ParRes / Kernels
This is a set of simple programs that can be used to explore the features of a parallel platform.
☆418Updated last month
Alternatives and similar repositories for Kernels:
Users that are interested in Kernels are comparing it to the libraries listed below
- HPCToolkit performance tools: measurement and analysis components☆338Updated 2 weeks ago
- RAJA Performance Portability Layer (C++)☆502Updated this week
- Information about many aspects of high-performance computing. Wiki content moved to ~/docs.☆280Updated last week
- Caliper is an instrumentation and performance profiling library☆360Updated last week
- STREAM, for lots of devices written in many programming models☆327Updated 5 months ago
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆322Updated this week
- RAJA Performance Suite☆118Updated this week
- A streamlined CMake build system foundation for developing HPC software☆266Updated last week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆105Updated last year
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆106Updated 6 months ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆91Updated 4 months ago
- Run a parallel command inside a split tmux window☆142Updated 2 years ago
- A light-weight MPI profiler.☆86Updated 6 months ago
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆262Updated 2 weeks ago
- The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.☆209Updated this week
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆407Updated 2 months ago
- Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem☆309Updated this week
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆198Updated last month
- Data parallel C++ mathematical object library☆158Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆116Updated last week
- A massively-parallel, block-sparse tensor framework written in C++☆267Updated last week
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆358Updated 5 months ago
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆211Updated this week
- ☆156Updated 2 weeks ago
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆101Updated 2 months ago
- Advanced Profiling and Analytics for AMD Hardware☆139Updated this week
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆56Updated last week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆102Updated this week
- Abstraction Library for Parallel Kernel Acceleration☆362Updated this week
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆52Updated 3 weeks ago