SparseBLAS / spblas-reference
☆12Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for spblas-reference
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆50Updated this week
- Training examples for SYCL☆38Updated last week
- ☆17Updated 10 months ago
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆21Updated last month
- CPE change log and release notes☆26Updated 2 months ago
- Molecular dynamics proxy application based on Kokkos☆31Updated 4 months ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 2 months ago
- Highly Efficient FFT for Exascale☆35Updated 6 months ago
- Comb is a communication performance benchmarking tool.☆24Updated last year
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- Tools to run and parse MKL verbose mode☆17Updated 2 years ago
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated 3 months ago
- ☆10Updated 3 months ago
- Distributed View Extension for Kokkos☆43Updated 2 months ago
- ☆14Updated last week
- Experimental MPI Wrapper for Kokkos☆16Updated 2 weeks ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆39Updated 9 months ago
- Error-Free Transformations as building blocks for compensated algorithms☆14Updated last year
- ☆52Updated last week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆93Updated 3 weeks ago
- The Kokkos Fortran Interop repository contains tools and interfaces which help interactions between Fortran portions of an applications a…☆31Updated 2 weeks ago
- OpenMP vs Offload☆21Updated last year
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆54Updated last week
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆48Updated 3 months ago
- High-performance, GPU-aware communication library☆84Updated last month
- library for measuring communication in distributed-memory parallel applications that use the standard Message-Passing Interface (MPI)☆19Updated 7 months ago
- Run a parallel command inside a split tmux window☆136Updated 2 years ago
- Intermediate MPI lesson☆26Updated last year
- Implementation of a cool communication layer☆15Updated last week
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆29Updated 2 months ago