xianyi / BLAS-Tester
a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester
☆33Updated last year
Related projects: ⓘ
- sparse matrix pre-processing library☆81Updated 4 months ago
- Experimental Linear Algebra Performance Studies☆12Updated 7 years ago
- Tensor Contraction Code Generator☆36Updated 7 years ago
- Autonomic Performance Environment for eXascale (APEX)☆38Updated this week
- Compute applications.☆25Updated 4 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- OpenSHMEM Application Programming Interface☆51Updated 3 weeks ago
- Recursive LAPACK Collection☆42Updated 2 years ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆47Updated last year
- Interoperability examples for OpenACC.☆48Updated 3 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆74Updated this week
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆44Updated 9 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆78Updated last month
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆103Updated last month
- PLASMA is a software package for solving problems in dense linear algebra using OpenMP☆24Updated last month
- Fork of magma to include more BLAS☆28Updated 7 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 6 years ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆104Updated last week
- A library for C++/Fortran computer simulations (e.g. stencil codes, mesh-free, unstructured grids, n-body & particle methods). Scales fro…☆38Updated 3 years ago
- A mirror of FleCSI's internal gitlab repository.☆67Updated 3 years ago
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆52Updated 4 years ago
- A mirror of cinch's internal gitlab repository.☆22Updated last year
- ☆82Updated 7 years ago
- Implementation of AMD HIP for CPUs☆22Updated 4 years ago
- ☆19Updated this week
- High-performance object-based library for DLA computations☆38Updated 6 months ago
- RAJA Performance Suite☆110Updated last week
- MPI wrapper generator, for writing PMPI tool libraries☆34Updated last year
- mirror from http://lotsofcores.com book 2, since dropbox isn't good for everyone☆38Updated 8 years ago