xianyi / BLAS-Tester
a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester
☆34Updated last year
Alternatives and similar repositories for BLAS-Tester:
Users that are interested in BLAS-Tester are comparing it to the libraries listed below
- sparse matrix pre-processing library☆81Updated 8 months ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Experimental Linear Algebra Performance Studies☆12Updated 7 years ago
- Tensor Contraction Code Generator☆36Updated 7 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 9 years ago
- Autonomic Performance Environment for eXascale (APEX)☆42Updated this week
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆106Updated 5 months ago
- OpenSHMEM Application Programming Interface☆51Updated 2 months ago
- Compute applications.☆24Updated 5 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆104Updated last year
- MPI wrapper generator, for writing PMPI tool libraries☆34Updated 2 years ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆79Updated this week
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated last year
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆82Updated this week
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆49Updated last year
- SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability…☆101Updated last month
- Recursive LAPACK Collection☆42Updated 2 years ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆14Updated 3 months ago
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆52Updated 5 years ago
- High-performance, GPU-aware communication library☆84Updated last week
- ☆86Updated 7 years ago
- RAJA Performance Suite☆117Updated this week
- mirror from http://lotsofcores.com book 2, since dropbox isn't good for everyone☆38Updated 8 years ago
- GPU implementation of classical molecular dynamics proxy application.☆31Updated 7 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Next generation LAPACK implementation for ROCm platform☆97Updated this week
- The SparseX sparse kernel optimization library☆39Updated 6 years ago
- Vector Math Library☆76Updated 8 years ago