flame / tblis-strassenLinks
Strassen's Algorithm for Tensor Contraction
☆13Updated 8 years ago
Alternatives and similar repositories for tblis-strassen
Users that are interested in tblis-strassen are comparing it to the libraries listed below
Sorting:
- Tensor Contraction Code Generator☆39Updated 8 years ago
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated last year
- HiCMA: Hierarchical Computations on Manycore Architectures☆32Updated 2 years ago
- Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays☆208Updated 4 months ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆31Updated this week
- C++ library for tensor computations☆35Updated 2 years ago
- The Surprisingly ParalleL spArse Tensor Toolkit.☆73Updated 3 years ago
- Performance engineering for the rest of us.☆31Updated 3 weeks ago
- sparse matrix pre-processing library☆83Updated last year
- Basic Tensor Algebra Subroutines☆48Updated 2 months ago
- DLA-Future☆79Updated this week
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated 3 months ago
- Classical molecular dynamics proxy application.☆32Updated 5 years ago
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆79Updated 2 months ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆25Updated 4 months ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆70Updated 7 months ago
- Tensor Algebra Library Routines for Shared Memory Systems☆38Updated last year
- Partitioned Global Address Space (PGAS) library for distributed arrays☆107Updated last week
- ☆35Updated 5 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆37Updated 4 years ago
- A BUDE virtual-screening benchmark, in many programming models☆29Updated last year
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆22Updated 10 months ago
- High-Performance Tensor Transpose library☆205Updated 2 years ago
- Compute applications.☆25Updated 5 years ago
- Library for exact linear algebra, a C++ template-library based originally on LinBox intended for F4-like implementations☆18Updated 12 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆45Updated last year
- TTC: A high-performance Compiler for Tensor Transpositions☆21Updated 8 years ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆54Updated 2 years ago
- Recursive LAPACK Collection☆44Updated 3 years ago
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆31Updated last week