alpaka-group / alpakaLinks

Abstraction Library for Parallel Kernel Acceleration

☆384

Alternatives and similar repositories for alpaka

Users that are interested in alpaka are comparing it to the libraries listed below

Sorting:

LLNL / RAJA
RAJA Performance Portability Layer (C++)
☆524Updated this week
kokkos / kokkos-kernels
Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels
☆342Updated last week
LLNL / blt
A streamlined CMake build system foundation for developing HPC software
☆268Updated 2 weeks ago
NERSC / timemory
Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…
☆360Updated 10 months ago
LLNL / Caliper
Caliper is an instrumentation and performance profiling library
☆378Updated this week
rabauke / mpl
A C++17 message passing library based on MPI
☆171Updated 3 weeks ago
ValeevGroup / tiledarray
A massively-parallel, block-sparse tensor framework written in C++
☆294Updated this week
kokkos / kokkos-tools
Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools
☆130Updated last week
libocca / occa
Portable and vendor neutral framework for parallel programming on heterogeneous platforms.
☆422Updated last week
codeplaysoftware / portBLAS
Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.
☆261Updated 5 months ago
kokkos / kokkos-tutorials
Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem
☆330Updated 2 weeks ago
charmplusplus / charm
The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.
☆216Updated last week
UoB-HPC / BabelStream
STREAM, for lots of devices written in many programming models
☆343Updated 10 months ago
kokkos / mdspan
Reference implementation of mdspan targeting C++23
☆469Updated last week
arborx / ArborX
Performance-portable geometric search library
☆207Updated last week
celerity / celerity-runtime
High-level C++ for Accelerator Clusters
☆145Updated last week
ParRes / Kernels
This is a set of simple programs that can be used to explore the features of a parallel platform.
☆433Updated 3 weeks ago
eyalroz / cuda-kat
CUDA kernel author's tools
☆111Updated 3 years ago
uxlfoundation / oneMath
oneAPI Math Library (oneMath)
☆690Updated this week
NVIDIA / jitify
A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
☆545Updated last week
romeric / Fastor
A lightweight high performance tensor algebra framework for modern C++
☆791Updated last year
agenium-scale / nsimd
Agenium Scale vectorization library for CPUs and GPUs
☆333Updated 3 years ago
dash-project / dash
DASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science
☆159Updated 3 years ago
eyalroz / cuda-api-wrappers
Thin, unified, C++-flavored wrappers for the CUDA APIs
☆844Updated last week
kokkos / stdBLAS
Reference Implementation for stdBLAS
☆143Updated last month
KhronosGroup / SyclParallelSTL
Open Source Parallel STL implementation
☆528Updated last year
cusplibrary / cusplibrary
CUSP : A C++ Templated Sparse Matrix Library
☆413Updated 2 weeks ago
Alpine-DAV / ascent
A flyweight in situ visualization and analysis runtime for multi-physics HPC simulations
☆210Updated this week
LLNL / RAJAPerf
RAJA Performance Suite
☆117Updated this week
ROCm / aomp
AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…
☆224Updated last week