kjbartel / magma
Matrix Algebra on GPU and Multicore Architectures (MAGMA) source releases from http://icl.cs.utk.edu/magma/index.html
☆22Updated 9 years ago
Alternatives and similar repositories for magma:
Users that are interested in magma are comparing it to the libraries listed below
- cuASR: CUDA Algebra for Semirings☆35Updated 2 years ago
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆27Updated 7 months ago
- Tensor Contraction Code Generator☆36Updated 7 years ago
- Automatic Differentiation for Tensor Algebras☆29Updated 6 years ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 5 years ago
- Recursive LAPACK Collection☆42Updated 3 years ago
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆34Updated 2 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆36Updated 3 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- Torch Frontend for IREE☆25Updated last year
- A domain-specific language and compiler for image processing☆76Updated 3 years ago
- MagmaDNN: a simple deep learning framework in c++☆49Updated 4 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆83Updated 3 weeks ago
- MLIR tools and dialect for GraphBLAS☆18Updated 2 years ago
- Parallel network flows using OpenMP and CUDA.☆27Updated 6 years ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆14Updated 4 months ago
- A unified framework across multiple programming platforms☆36Updated 8 months ago
- DLA-Future☆69Updated this week
- Data-Centric MLIR dialect☆40Updated last year
- Yaksa: High-performance Noncontiguous Data Management☆13Updated 4 months ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Next generation LAPACK implementation for ROCm platform☆98Updated this week
- Open source of an IBM Optimized version of the HPCG benchmark.☆14Updated 11 months ago
- Emulating DMA Engines on GPUs for Performance and Portability☆37Updated 9 years ago
- Compute applications.☆24Updated 5 years ago
- Simplified Interface to Complex Memory☆27Updated last year
- Next generation library for iterative sparse solvers for ROCm platform☆78Updated this week
- ☆58Updated 2 years ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last year