eth-cscs / conflux
Distributed Communication-Optimal LU-factorization Algorithm
☆12Updated 3 years ago
Alternatives and similar repositories for conflux:
Users that are interested in conflux are comparing it to the libraries listed below
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆30Updated 2 months ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated last year
- ☆17Updated last year
- MPI accelerator-integrated communication extensions☆32Updated last year
- Comb is a communication performance benchmarking tool.☆24Updated last year
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆39Updated last year
- ☆20Updated 2 months ago
- MagmaDNN: a simple deep learning framework in c++☆49Updated 4 years ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Updated 5 years ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆21Updated 6 years ago
- ☆11Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆78Updated this week
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆31Updated 3 months ago
- ☆15Updated 3 years ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 4 months ago
- Distributed View Extension for Kokkos☆44Updated 2 months ago
- Error-Free Transformations as building blocks for compensated algorithms☆14Updated last year
- Autonomic Performance Environment for eXascale (APEX)☆43Updated this week
- A task benchmark☆41Updated 6 months ago
- RAJA Performance Suite☆118Updated this week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆49Updated this week
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆39Updated last month
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆33Updated 3 months ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆25Updated 2 weeks ago
- DLA-Future☆69Updated this week
- HPCG benchmark based on ROCm platform☆36Updated 3 weeks ago
- A unified framework across multiple programming platforms☆36Updated 7 months ago
- Highly Efficient FFT for Exascale☆37Updated 9 months ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆105Updated last year
- sparse matrix pre-processing library☆81Updated 9 months ago