cslab-ntua / sparsexLinks
The SparseX sparse kernel optimization library
☆39Updated 6 years ago
Alternatives and similar repositories for sparsex
Users that are interested in sparsex are comparing it to the libraries listed below
Sorting:
- sparse matrix pre-processing library☆82Updated last year
- MPI wrapper generator, for writing PMPI tool libraries☆34Updated 2 months ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- A task benchmark☆42Updated 9 months ago
- Tensor Contraction Code Generator☆37Updated 7 years ago
- Oak Ridge OpenSHMEM Benchmarks☆15Updated 6 years ago
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆26Updated 5 years ago
- Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)☆24Updated last year
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- Loop Kernel Analysis and Performance Modeling Toolkit☆93Updated 2 months ago
- Simplified Interface to Complex Memory☆28Updated last year
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆75Updated this week
- Nanos++ is a runtime designed to serve as runtime support in parallel environments. It is mainly used to support OmpSs, a extension to O…☆37Updated 3 years ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆24Updated 6 months ago
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 6 years ago
- Compute applications.☆24Updated 5 years ago
- ☆29Updated last week
- Global Memory and Threading runtime system☆23Updated last year
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 10 years ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆22Updated 5 years ago
- Autonomic Performance Environment for eXascale (APEX)☆48Updated 2 weeks ago
- Logger for MPI communication☆27Updated last year
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆35Updated 5 years ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆70Updated last month
- CSR-based SpGEMM on nVidia and AMD GPUs☆46Updated 9 years ago
- ☆44Updated 4 years ago
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago