kokkos / stdBLASLinks
Reference Implementation for stdBLAS
☆151Updated last week
Alternatives and similar repositories for stdBLAS
Users that are interested in stdBLAS are comparing it to the libraries listed below
Sorting:
- Reference implementation of mdspan targeting C++23☆486Updated last week
- A C++17 message passing library based on MPI☆179Updated 2 months ago
- C++20 and onward collection of high performance data containers and related tools☆57Updated 2 weeks ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆52Updated 3 months ago
- pika is a C++ tasking library built on std::execution with fibers, CUDA, HIP, and MPI support.☆79Updated last week
- improve the usage experience of std::simd (Parallelism TS 2)☆30Updated 4 months ago
- A C++17 interface for HDF5☆98Updated last month
- Various documents and code related to proposals for WG21☆67Updated last year
- A Low-Level Abstraction of Memory Access☆92Updated last year
- Performance-portable geometric search library☆219Updated last week
- Boost.uBlas☆119Updated 2 weeks ago
- Abstraction Library for Parallel Kernel Acceleration☆399Updated this week
- Caliper is an instrumentation and performance profiling library☆395Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆95Updated 3 weeks ago
- A highly optimised C++ library for mathematical applications and neural networks.☆177Updated 4 months ago
- Header-only C++20 wrapper for MPI 4.0.☆47Updated 2 years ago
- a small lightweight std::execution work-alike☆65Updated 8 months ago
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆366Updated last year
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆109Updated last week
- A streamlined CMake build system foundation for developing HPC software☆280Updated last week
- Measures high-level timing and memory usage metrics during compilation☆76Updated 4 years ago
- An OpenMP runtime implemented using HPX☆24Updated 3 years ago
- ☆14Updated last year
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆369Updated last week
- General-purpose C++ graph library☆232Updated 4 months ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆91Updated 2 months ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆136Updated 2 weeks ago
- C++ HPC Math Library☆45Updated 6 years ago
- A fully featured single header library implementing a vector container with a small buffer optimization.☆71Updated 7 months ago
- CS infrastructure components for HPC applications☆180Updated this week