xianyi / BLAS-Tester
a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester
☆34Updated 2 years ago
Alternatives and similar repositories for BLAS-Tester:
Users that are interested in BLAS-Tester are comparing it to the libraries listed below
- sparse matrix pre-processing library☆82Updated 10 months ago
- Tensor Contraction Code Generator☆36Updated 7 years ago
- Recursive LAPACK Collection☆42Updated 3 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- OpenSHMEM Application Programming Interface☆54Updated 4 months ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆49Updated last year
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆106Updated 7 months ago
- Experimental Linear Algebra Performance Studies☆12Updated 8 years ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated last week
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Autonomic Performance Environment for eXascale (APEX)☆44Updated this week
- Loop Kernel Analysis and Performance Modeling Toolkit☆92Updated this week
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- Vector Math Library☆78Updated 8 years ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆77Updated 2 weeks ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆14Updated 5 months ago
- Next generation library for iterative sparse solvers for ROCm platform☆78Updated this week
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 10 years ago
- A Monte Carlo transport mini-app for studying new parallel algorithms☆17Updated 2 months ago
- Compute applications.☆24Updated 5 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆84Updated last week
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆49Updated 6 months ago
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆62Updated 2 months ago
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆40Updated last year
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆110Updated 2 months ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆66Updated this week
- Fast matrix multiplication☆29Updated 3 years ago
- MPI wrapper generator, for writing PMPI tool libraries☆34Updated 2 years ago
- Implementation of MPI that supports large counts☆48Updated 3 months ago