flame / tblis-strassenLinks
Strassen's Algorithm for Tensor Contraction
☆12Updated 7 years ago
Alternatives and similar repositories for tblis-strassen
Users that are interested in tblis-strassen are comparing it to the libraries listed below
Sorting:
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated 11 months ago
- Tensor Contraction Code Generator☆37Updated 7 years ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆27Updated this week
- C++ library for tensor computations☆35Updated 2 years ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- A BUDE virtual-screening benchmark, in many programming models☆29Updated 8 months ago
- Tensor Algebra Library Routines for Shared Memory Systems☆38Updated last year
- DLA-Future☆75Updated last month
- Communication Avoiding Numerical Dense Matrix Computations☆11Updated 4 years ago
- Basic Tensor Algebra Subroutines☆48Updated 2 weeks ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆14Updated 8 months ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆77Updated 3 weeks ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆105Updated 2 weeks ago
- A place to store information for the tensor discussions and possible specifications.☆17Updated 3 months ago
- Reference implementation of the draft C++ GraphBLAS specification.☆33Updated 4 months ago
- A scalable eigensolver for dense, symmetric (hermitian) matrices (fork of https://gitlab.mpcdf.mpg.de/elpa/elpa.git)☆30Updated 4 months ago
- Omni Compiler for C and Fortran programs with XcalableMP and OpenACC directives☆61Updated last year
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆68Updated 3 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆54Updated 3 months ago
- sparse matrix pre-processing library☆82Updated last year
- Apollo: Online Machine Learning for Performance Portability☆23Updated 10 months ago
- cuASR: CUDA Algebra for Semirings☆36Updated 2 years ago
- ☆34Updated 5 years ago
- Performance engineering for the rest of us.☆31Updated 2 years ago
- Classical molecular dynamics proxy application.☆31Updated 4 years ago
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆30Updated this week
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆52Updated last year
- An MPI ABI compatibility layer☆33Updated 3 months ago
- A Task-based Library for Solving Dense Nonsymmetric Eigenvalue Problems☆23Updated 2 years ago