cslab-ntua / sparsex
The SparseX sparse kernel optimization library
☆40Updated 6 years ago
Alternatives and similar repositories for sparsex:
Users that are interested in sparsex are comparing it to the libraries listed below
- sparse matrix pre-processing library☆81Updated 11 months ago
- Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)☆25Updated last year
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆24Updated 5 months ago
- Comb is a communication performance benchmarking tool.☆24Updated 2 years ago
- CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)☆27Updated 9 years ago
- Global Memory and Threading runtime system☆23Updated 11 months ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- A task benchmark☆41Updated 8 months ago
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆26Updated 4 years ago
- CSR-based SpGEMM on nVidia and AMD GPUs☆45Updated 9 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆93Updated 3 weeks ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆34Updated 5 years ago
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 6 years ago
- ☆43Updated 4 years ago
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆21Updated 5 years ago
- spGPU library for sparse linear algebra on GPUs☆9Updated 2 years ago
- Tensor Contraction Code Generator☆37Updated 7 years ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆24Updated 6 years ago
- ☆17Updated last year
- Experimental Linear Algebra Performance Studies☆12Updated 8 years ago
- MPI wrapper generator, for writing PMPI tool libraries☆34Updated 3 weeks ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- Automatically exported from code.google.com/p/patus☆15Updated 9 years ago
- bhSPARSE: A Sparse BLAS Library☆16Updated 9 years ago
- A Multi-purpose, Application-Centric, Scalable I/O Proxy Application☆34Updated 4 years ago
- Compute applications.☆24Updated 5 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 10 years ago
- Oak Ridge OpenSHMEM Benchmarks☆15Updated 6 years ago
- ☆10Updated 3 weeks ago