kjbartel / magmaLinks
Matrix Algebra on GPU and Multicore Architectures (MAGMA) source releases from http://icl.cs.utk.edu/magma/index.html
☆24Updated 10 years ago
Alternatives and similar repositories for magma
Users that are interested in magma are comparing it to the libraries listed below
Sorting:
- Tensor Contraction Code Generator☆39Updated 8 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆89Updated this week
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆79Updated 3 months ago
- Python wrapper for isl, an integer set library☆80Updated this week
- A domain-specific language and compiler for image processing☆77Updated 4 years ago
- MagmaDNN: a simple deep learning framework in c++☆51Updated 5 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆21Updated 8 years ago
- A GPU performance prediction toolkit for CUDA programs☆18Updated 6 years ago
- DLA-Future☆80Updated last week
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆31Updated last year
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆39Updated this week
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆25Updated 5 months ago
- Par4All is an automatic parallelizing and optimizing compiler (workbench) for C and Fortran sequential programs☆53Updated 10 years ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆56Updated 2 years ago
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 3 years ago
- Zoltan Dynamic Load Balancing and Graph Algorithm Toolkit -- Distribution site☆40Updated 2 years ago
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆93Updated last week
- ☆63Updated this week
- Compute applications.☆25Updated 5 years ago
- Recursive LAPACK Collection☆44Updated 3 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆115Updated last week
- A unified framework across multiple programming platforms☆42Updated 6 months ago
- Interoperability examples for OpenACC.☆48Updated 5 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆74Updated 2 years ago
- This repository is the summary of all of our works for the XLA.☆11Updated 7 years ago
- ☆87Updated 8 years ago
- TTG: Template Task Graph C++ API☆26Updated this week
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆60Updated 2 weeks ago
- CUDA Dynamic Memory Allocator for SOA Data Layout☆38Updated 3 years ago
- A library for code transformations with guaranteed legality☆18Updated this week