kjbartel / magmaLinks
Matrix Algebra on GPU and Multicore Architectures (MAGMA) source releases from http://icl.cs.utk.edu/magma/index.html
☆24Updated 10 years ago
Alternatives and similar repositories for magma
Users that are interested in magma are comparing it to the libraries listed below
Sorting:
- Tensor Contraction Code Generator☆39Updated 8 years ago
- ☆87Updated 8 years ago
- Automatic Differentiation for Tensor Algebras☆28Updated 7 years ago
- Strassen's Algorithm for Tensor Contraction☆13Updated 8 years ago
- ☆64Updated 2 weeks ago
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆31Updated last year
- Compute applications.☆25Updated 6 years ago
- Fast matrix multiplication☆31Updated 4 years ago
- DLA-Future☆81Updated last month
- Subset of BLAS routines optimized for NVIDIA GPUs☆74Updated 2 years ago
- Fork of magma to include more BLAS☆28Updated 9 years ago
- CUDA Dynamic Memory Allocator for SOA Data Layout☆38Updated 3 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆90Updated 2 weeks ago
- A GPU performance prediction toolkit for CUDA programs☆18Updated 6 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆21Updated 8 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆80Updated 4 months ago
- MagmaDNN: a simple deep learning framework in c++☆51Updated 5 years ago
- ☆74Updated 2 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- Distributed-parallel C/C++ Tensor Library☆20Updated 3 months ago
- a heterogeneous multiGPU level-3 BLAS library☆46Updated 6 years ago
- Vector Math Library☆84Updated last month
- Introduction to CUDA programming☆129Updated 8 years ago
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 5 years ago
- This repository is the summary of all of our works for the XLA.☆11Updated 7 years ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆91Updated 2 years ago
- A task benchmark☆44Updated last year
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆95Updated 3 weeks ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆23Updated last month
- A C/C++ task-based programming model for shared memory and distributed parallel computing.☆72Updated 5 years ago