Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceleration.
☆31Jun 26, 2024Updated last year
Alternatives and similar repositories for spla
Users that are interested in spla are comparing it to the libraries listed below
Sorting:
- Porting meshing tools and solvers that deal with unstructured meshes on GPUs☆15Apr 7, 2025Updated 10 months ago
- Base container for developing C++ and Fortran HPC applications☆18Jun 14, 2022Updated 3 years ago
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆17Aug 21, 2023Updated 2 years ago
- DLA-Future☆83Jan 30, 2026Updated last month
- ☆22Updated this week
- Frame-to-Frame Registration using Gaussian Mixture Models.☆23Mar 2, 2024Updated last year
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆24Nov 25, 2025Updated 3 months ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Jul 25, 2025Updated 7 months ago
- C++17 Wrapper for ScaLAPACK☆11Oct 5, 2023Updated 2 years ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆75Oct 22, 2025Updated 4 months ago
- A SCVT mesh generation tool☆13Nov 28, 2020Updated 5 years ago
- Generic exascale-ready library for halo-exchange operations on variety of grids/meshes☆10Dec 23, 2025Updated 2 months ago
- The CSCS ReFrame test suite☆15Updated this week
- Parallel Finite volume moving (Voronoi) mesh MHD code written in Charm++☆15Aug 16, 2012Updated 13 years ago
- Development/testing repo for SWIG+Fortran☆11Mar 25, 2018Updated 7 years ago
- libmpdata++ - a library of parallel MPDATA-based solvers for systems of generalised transport equations☆12Jan 13, 2026Updated last month
- Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.☆13Nov 3, 2023Updated 2 years ago
- An MLIR-based AI compiler designed for Python frontend to RISC-V DSA☆13Oct 10, 2024Updated last year
- Netlib Scalapack with robust CMake☆14Feb 2, 2026Updated 3 weeks ago
- Code accompanying the paper entitled LEVIO: Lightweight Embedded Visual Inertial Odometry for Resource-Constrained Devices☆33Updated this week
- C++ library for graph ordering☆15Mar 20, 2020Updated 5 years ago
- A Multistate Low-dissipation Advection Upstream Splitting Method for Ideal Magnetohydrodynamics / A low-dissipation HLLD approximate Riem…☆13May 17, 2023Updated 2 years ago
- ☆11Feb 20, 2021Updated 5 years ago
- Multiprocessor Algorithms for Nonlinear Gradient-free Optimization☆12Jul 1, 2020Updated 5 years ago
- ☆12Oct 18, 2024Updated last year
- ☆14Sep 22, 2019Updated 6 years ago
- ☆13Aug 4, 2017Updated 8 years ago
- MPI+Kokkos implementation of spectral difference method (SDM) high order schemes☆28Feb 2, 2025Updated last year
- Immersed boundary fractional step method written in FORTRAN 90☆16Aug 14, 2012Updated 13 years ago
- A C++ linear algebra algebra focusing on tensor tree classes designed for quantum dynamics simulations and machine learning applications☆20Apr 16, 2024Updated last year
- Distributed cartesian cell-refinable grid☆13Jun 13, 2025Updated 8 months ago
- Build adaptive meshes from png files.☆13Mar 25, 2025Updated 11 months ago
- DARMA/magistrate => Serialization and checkpointing library☆12Jan 26, 2026Updated last month
- Finite volume operators with anisotropic adaptive mesh refinement on body-fitted structured grids☆14May 30, 2017Updated 8 years ago
- Piernik MHD Code☆15Feb 11, 2026Updated 2 weeks ago
- Userspace eBPF Runtime Benchmarking Test Suite and Results☆16Apr 21, 2024Updated last year
- Massively Asynchronous Coding Environment☆18Oct 21, 2012Updated 13 years ago
- Plane Wave Density Functional Theory Code for the GPU☆12Jan 23, 2015Updated 11 years ago