riakymch / exblas
ExBLAS: fast, accurate, and reproducible BLAS
☆13Updated 3 years ago
Alternatives and similar repositories for exblas:
Users that are interested in exblas are comparing it to the libraries listed below
- Recursive LAPACK Collection☆42Updated 3 years ago
- Communication Avoiding Numerical Dense Matrix Computations☆11Updated 4 years ago
- This repository mirrors the principal Gitlab repository of the Chebyshev Accelerated Subspace iteration Eigensolver. If you want to contr…☆16Updated 3 weeks ago
- Julia package for accelerating sparse matrix applications.☆18Updated 2 years ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆66Updated this week
- PaStiX (Parallel Sparse matriX package) solver library☆13Updated 6 years ago
- Flexible and performant GEMM kernels in Julia☆80Updated 4 months ago
- Linnea is an experimental tool for the automatic generation of optimized code for linear algebra problems.☆68Updated 3 years ago
- Proof of Concept: a C-callable GPU-enabled parallel 2-D heat diffusion solver written in Julia using CUDA, MPI and graphics☆24Updated 4 years ago
- A hierarchical matrix C/C++ library☆23Updated last week
- associative floating point addition☆17Updated 10 months ago
- Error-Free Transformations as building blocks for compensated algorithms☆14Updated 2 years ago
- A Julia interface to ADIOS2☆14Updated last year
- H2 Matrix Package☆29Updated last year
- Programming Gemm Kernels on NVIDIA GPUs with Tensor Cores in Julia☆40Updated 4 months ago
- Custom-Precision Floating-point numbers.☆33Updated 2 months ago
- The MPLAPACK: multiple precision version of BLAS and LAPACK☆87Updated 9 months ago
- ☆62Updated last month
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- CMake FindLAPACK.cmake that works with Intel MKL, Atlas, OpenBLAS, Netlib, LAPACK95 for C / C++ / Fortran☆15Updated 2 years ago
- TensorOperations and cuTENSOR combined☆13Updated 5 years ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆77Updated 2 weeks ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆21Updated 3 months ago
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆49Updated last year
- Efficient computations with symmetric and non-symmetric tensors with support for automatic differentiation.☆15Updated 7 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆110Updated 2 months ago
- Library for chordal matrix computations☆24Updated 6 years ago
- Fast orthogonal polynomial transforms☆61Updated 6 months ago
- ☆26Updated this week
- Sparse symmetric indefinite solver implemented with a runtime system☆13Updated 4 years ago