cslab-ntua / sparsex
The SparseX sparse kernel optimization library
☆40Updated 6 years ago
Alternatives and similar repositories for sparsex:
Users that are interested in sparsex are comparing it to the libraries listed below
- sparse matrix pre-processing library☆82Updated 10 months ago
- Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour u…☆45Updated 5 years ago
- Chai☆43Updated last year
- Loop Kernel Analysis and Performance Modeling Toolkit☆92Updated this week
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆23Updated 6 years ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆23Updated 4 months ago
- Comb is a communication performance benchmarking tool.☆24Updated 2 years ago
- Global Memory and Threading runtime system☆23Updated 10 months ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆40Updated last year
- Tensor Contraction Code Generator☆36Updated 7 years ago
- Parallel Tensor Infrastructure (ParTI!)☆28Updated 4 years ago
- MPI wrapper generator, for writing PMPI tool libraries☆34Updated 2 years ago
- XSBench: The Monte Carlo Macroscopic Cross Section Lookup Benchmark☆78Updated last year
- A task benchmark☆41Updated 7 months ago
- Parallelized and vectorized SpMV on Intel Xeon Phi (Knights Landing, AVX512, KNL)☆25Updated last year
- ☆43Updated 4 years ago
- Nanos++ is a runtime designed to serve as runtime support in parallel environments. It is mainly used to support OmpSs, a extension to O…☆38Updated 3 years ago
- This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Trian…☆26Updated 4 years ago
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 6 years ago
- Simplified Interface to Complex Memory☆27Updated last year
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆34Updated 5 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆36Updated 3 years ago
- A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves (SpTRSV)☆21Updated 5 years ago
- Compute applications.☆24Updated 5 years ago
- bhSPARSE: A Sparse BLAS Library☆16Updated 9 years ago
- CSR-based SpMV on Heterogeneous Processors (Intel Broadwell, AMD Kaveri and nVidia Tegra K1)☆27Updated 9 years ago
- A Benchmark Suite for Heterogeneous System Computation☆53Updated last month