IntelLabs / SpMPLinks

sparse matrix pre-processing library

☆83

Alternatives and similar repositories for SpMP

Users that are interested in SpMP are comparing it to the libraries listed below

Sorting:

cslab-ntua / sparsex
The SparseX sparse kernel optimization library
☆43Updated 6 years ago
bryancatanzaro / inplace
CUDA and OpenMP implementations of C2R/R2C inplace transposition
☆48Updated 10 years ago
linnanwang / BLASX
a heterogeneous multiGPU level-3 BLAS library
☆46Updated 6 years ago
michael-lehn / ulmBLAS
ulmBLAS
☆108Updated 6 months ago
cusplibrary / cusplibrary
CUSP : A C++ Templated Sparse Matrix Library
☆419Updated 4 months ago
intel / yask
YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…
☆110Updated 5 months ago
bryancatanzaro / trove
Full-speed Array of Structures access
☆176Updated 2 years ago
xianyi / BLAS-Tester
a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester
☆36Updated 2 years ago
sympiler / sympiler
Sympiler is a Code Generator for Transforming Sparse Matrix Codes
☆44Updated 2 years ago
LLNL / Aluminum
High-performance, GPU-aware communication library
☆86Updated 11 months ago
RRZE-HPC / kerncraft
Loop Kernel Analysis and Performance Modeling Toolkit
☆96Updated 8 months ago
ecrc / kblas-gpu
Subset of BLAS routines optimized for NVIDIA GPUs
☆74Updated 2 years ago
hpcgarage / ParTI
Parallel Tensor Infrastructure (ParTI!)
☆31Updated 5 years ago
ShadenSmith / splatt
The Surprisingly ParalleL spArse Tensor Toolkit.
☆73Updated 3 years ago
EBD-CREST / nsparse
Sparse matrix computation library for GPU
☆58Updated 5 years ago
UO-OACISS / apex
Autonomic Performance Environment for eXascale (APEX)
☆49Updated 5 months ago
ecrc / hicma
HiCMA: Hierarchical Computations on Manycore Architectures
☆33Updated 2 years ago
ap-hynninen / cutt
CUDA Tensor Transpose (cuTT) library
☆53Updated 8 years ago
HPAC / TTC
TTC: A high-performance Compiler for Tensor Transpositions
☆21Updated 8 years ago
springer13 / hptt
High-Performance Tensor Transpose library
☆204Updated 2 years ago
arbenson / fast-matmul
Fast matrix multiplication
☆31Updated 4 years ago
LLNL / RAJAPerf
RAJA Performance Suite
☆125Updated this week
pssrawat / artemis
GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details
☆24Updated 6 years ago
SparseBLAS / spblas-reference
☆34Updated 2 months ago
PASSIONLab / CombBLAS
The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …
☆79Updated 4 months ago
weifengliu-ssslab / Benchmark_SpMV_using_CSR5
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
☆108Updated last year
weifengliu-ssslab / Benchmark_SpTRSV_using_CSC
A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)
☆22Updated 5 years ago
HiPerCoRe / KTT
Kernel Tuning Toolkit
☆64Updated last month
StanfordLegion / task-bench
A task benchmark
☆45Updated last year
ChenhanYu / hmlp
High-Performance Machine Learning Primitives
☆12Updated 4 years ago