ecrc / polarLinks
Distributed-memory, double-precision, polar decomposition (QDWH/ZOLO-PD) of a dense matrix, svd (QDWH/ZOLOPD-SVD) of a dense matrix
☆13Updated 5 years ago
Alternatives and similar repositories for polar
Users that are interested in polar are comparing it to the libraries listed below
Sorting:
- A C++ library for computing large scale tensor contractions.☆38Updated 7 years ago
- Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays☆206Updated 2 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated 2 weeks ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆125Updated 2 months ago
- High-Performance Machine Learning Primitives☆12Updated 4 years ago
- Multiresolution Adaptive Numerical Environment for Scientific Simulation☆207Updated 3 weeks ago
- Data parallel C++ mathematical object library☆163Updated 3 weeks ago
- Implementation of MPI that supports large counts☆48Updated 8 months ago
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated last year
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆144Updated last week
- A massively-parallel, block-sparse tensor framework written in C++☆303Updated last week
- A Massively Parallel FFT Library for CPU/GPU☆56Updated 4 years ago
- Tensor Contraction Code Generator☆38Updated 7 years ago
- Tensor Algebra Library Routines for Shared Memory Systems☆38Updated last year
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆23Updated 10 months ago
- ☆67Updated last week
- Molecular dynamics proxy application based on Kokkos☆34Updated last year
- TBLIS is a library and framework for performing tensor operations, especially tensor contraction, using efficient native algorithms.☆129Updated 2 weeks ago
- Distributed Communication-Optimal Shuffle and Transpose Algorithm☆14Updated 3 months ago
- Training examples for SYCL☆49Updated last week
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆68Updated 2 months ago
- Header-only plugin for the Google Test framework defining listener(s) emitting sensible output when testing MPI-based, distributed-memory…☆21Updated 4 years ago
- GEMMul8 (GEMMulate): GEMM emulation using int8 matrix engines based on the Ozaki Scheme II☆22Updated last week
- directory for randomized cholesky☆18Updated 4 years ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆70Updated 4 months ago
- ☆15Updated 4 years ago
- Structured Matrix Package (LBNL)☆176Updated this week
- H2Lib public repository☆58Updated 2 years ago
- Tensor Contraction C++ Library☆53Updated 5 years ago
- DLA-Future☆77Updated this week