kjbartel / magma
Matrix Algebra on GPU and Multicore Architectures (MAGMA) source releases from http://icl.cs.utk.edu/magma/index.html
☆21Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for magma
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆27Updated 4 months ago
- Next generation library for iterative sparse solvers for ROCm platform☆76Updated this week
- cuASR: CUDA Algebra for Semirings☆34Updated 2 years ago
- Tensor Contraction Code Generator☆36Updated 7 years ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆19Updated this week
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆33Updated last year
- MagmaDNN: a simple deep learning framework in c++☆45Updated 4 years ago
- SuiteSparse: a suite of sparse matrix packages by @DrTimothyAldenDavis et al. with native CMake support☆52Updated 4 months ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- Data Dependence Analyzer in the Polyhedral Model☆19Updated last year
- ☆27Updated 3 weeks ago
- Recursive LAPACK Collection☆42Updated 2 years ago
- Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.☆11Updated last year
- Experimental Linear Algebra Performance Studies☆12Updated 7 years ago
- Next generation LAPACK implementation for ROCm platform☆95Updated this week
- StarPU Runtime system☆16Updated 14 years ago
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆18Updated this week
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆28Updated 4 years ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆66Updated 3 weeks ago
- ☆55Updated last year
- High-performance Geometric Multigrid☆33Updated 5 years ago
- Sympiler is a Code Generator for Transforming Sparse Matrix Codes☆42Updated last year
- Fork of magma to include more BLAS☆28Updated 8 years ago
- PLASMA is a software package for solving problems in dense linear algebra using OpenMP☆25Updated 3 months ago
- A hierarchical matrix C/C++ library☆22Updated this week
- CUDA Template Functions☆18Updated 3 months ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- A thread safe simple C++ wrapper for FFTW & MKL☆15Updated 3 years ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 5 years ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆48Updated last year