Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceleration.
☆32Jun 26, 2024Updated last year
Alternatives and similar repositories for spla
Users that are interested in spla are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Porting meshing tools and solvers that deal with unstructured meshes on GPUs☆15Apr 21, 2026Updated last month
- ☆22Jun 4, 2026Updated last week
- DLA-Future☆85Jun 1, 2026Updated last week
- Domain specific library for electronic structure calculations☆165Jun 4, 2026Updated last week
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆26May 17, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- MPI+Kokkos implementation of spectral difference method (SDM) high order schemes☆29Feb 2, 2025Updated last year
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Feb 28, 2019Updated 7 years ago
- Tensor Algebra for many-body methods☆18May 23, 2026Updated 2 weeks ago
- Netlib Scalapack with robust CMake☆14Mar 26, 2026Updated 2 months ago
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆18Aug 21, 2023Updated 2 years ago
- Frame-to-Frame Registration using Gaussian Mixture Models.☆23Mar 2, 2024Updated 2 years ago
- A SCVT mesh generation tool☆13Nov 28, 2020Updated 5 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆80Mar 27, 2023Updated 3 years ago
- Recipes for software stacks on Alps vClusters.☆15Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Jul 25, 2025Updated 10 months ago
- Userspace eBPF Runtime Benchmarking Test Suite and Results☆17Updated this week
- C++17 Wrapper for ScaLAPACK☆11Oct 5, 2023Updated 2 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated last year
- Code for Hutch++: Optimal Stochastic Trace Estimation☆13Aug 30, 2021Updated 4 years ago
- An MLIR-based AI compiler designed for Python frontend to RISC-V DSA☆15Oct 10, 2024Updated last year
- C++ library for graph ordering☆15Mar 20, 2020Updated 6 years ago
- A Monte Carlo Neutron Transport Mini-App☆15Apr 15, 2019Updated 7 years ago
- ☆11Aug 8, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆154Jun 4, 2026Updated last week
- JIT-compiled GPU kernels for quantum chemistry☆34Jan 30, 2026Updated 4 months ago
- Development/testing repo for SWIG+Fortran☆11Mar 25, 2018Updated 8 years ago
- ☆14Oct 8, 2016Updated 9 years ago
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆76Oct 22, 2025Updated 7 months ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- Simple small molecular docking and conformation filtering tool.☆13Jun 4, 2026Updated last week
- The Kokkos Fortran Interop repository contains tools and interfaces which help interactions between Fortran portions of an applications a…☆43Mar 12, 2026Updated 2 months ago
- Massively Asynchronous Coding Environment☆18Oct 21, 2012Updated 13 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GPU-Accelerated multigrid solver for Poisson's equation in 2D☆29Apr 2, 2026Updated 2 months ago
- ☆17Dec 10, 2018Updated 7 years ago
- ☆14Sep 22, 2019Updated 6 years ago
- High Performance Grouped GEMM in PyTorch☆30May 10, 2022Updated 4 years ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Mar 5, 2026Updated 3 months ago
- Double precision raytracer for scientific or engineering applications.☆12May 18, 2024Updated 2 years ago
- DARMA/magistrate => Serialization and checkpointing library☆12Jan 26, 2026Updated 4 months ago