kjbartel / magma
Matrix Algebra on GPU and Multicore Architectures (MAGMA) source releases from http://icl.cs.utk.edu/magma/index.html
☆23Updated 9 years ago
Alternatives and similar repositories for magma:
Users that are interested in magma are comparing it to the libraries listed below
- Interoperability examples for OpenACC.☆49Updated 4 years ago
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆28Updated 9 months ago
- ☆11Updated 5 years ago
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆34Updated 2 years ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 5 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆78Updated this week
- An alternative to Boost.MPI for a user friendly C++ interface for MPI (MPICH).☆19Updated 7 years ago
- ☆17Updated 2 weeks ago
- ☆38Updated last month
- Compute applications.☆24Updated 5 years ago
- Simplified Interface to Complex Memory☆27Updated last year
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- ☆86Updated 7 years ago
- Tensor Contraction Code Generator☆36Updated 7 years ago
- A thread safe simple C++ wrapper for FFTW & MKL☆15Updated 3 years ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 6 years ago
- Visualization tool for analyzing call trees and graphs☆31Updated 2 years ago
- DLA-Future☆70Updated this week
- The fftMPI library performs 2d/3d FFTs in parallel for grids distributed across MPI processes.☆14Updated 2 years ago
- Recursive LAPACK Collection☆42Updated 3 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆84Updated last week
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆51Updated 3 weeks ago
- Yaksa: High-performance Noncontiguous Data Management☆13Updated 6 months ago
- PLASMA is a software package for solving problems in dense linear algebra using OpenMP☆27Updated last week
- Mirror of https://gitlab.kitware.com/vtk/vtk-m☆31Updated last week
- An implementation of ARMCI using MPI one-sided communication (RMA)☆14Updated 5 months ago
- Zoltan Dynamic Load Balancing and Graph Algorithm Toolkit -- Distribution site☆34Updated last year
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆30Updated last week
- A C++based implementation of the TeaLeaf heat conduction mini-app. This implementation of TeaLeaf replicates the functionality of the ref…☆24Updated 7 months ago
- Performance engineering for the rest of us.☆30Updated last year