Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceleration.
☆32Jun 26, 2024Updated last year
Alternatives and similar repositories for spla
Users that are interested in spla are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The CSCS ReFrame test suite☆15Updated this week
- Porting meshing tools and solvers that deal with unstructured meshes on GPUs☆15Apr 21, 2026Updated last month
- ☆22Updated this week
- DLA-Future☆85Mar 27, 2026Updated last month
- Base container for developing C++ and Fortran HPC applications☆18Jun 14, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.☆13Nov 3, 2023Updated 2 years ago
- MPI+Kokkos implementation of spectral difference method (SDM) high order schemes☆29Feb 2, 2025Updated last year
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Feb 28, 2019Updated 7 years ago
- Netlib Scalapack with robust CMake☆14Mar 26, 2026Updated last month
- CSCS public documentation☆33Updated this week
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆18Aug 21, 2023Updated 2 years ago
- Frame-to-Frame Registration using Gaussian Mixture Models.☆23Mar 2, 2024Updated 2 years ago
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Jun 24, 2021Updated 4 years ago
- A SCVT mesh generation tool☆13Nov 28, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Subset of BLAS routines optimized for NVIDIA GPUs☆80Mar 27, 2023Updated 3 years ago
- Recipes for software stacks on Alps vClusters.☆15May 13, 2026Updated last week
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Jul 25, 2025Updated 9 months ago
- Userspace eBPF Runtime Benchmarking Test Suite and Results☆16Updated this week
- C++17 Wrapper for ScaLAPACK☆11Oct 5, 2023Updated 2 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated last year
- An MLIR-based AI compiler designed for Python frontend to RISC-V DSA☆14Oct 10, 2024Updated last year
- C++ library for graph ordering☆15Mar 20, 2020Updated 6 years ago
- A Monte Carlo Neutron Transport Mini-App☆15Apr 15, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Aug 8, 2021Updated 4 years ago
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆153Apr 28, 2026Updated 3 weeks ago
- JIT-compiled GPU kernels for quantum chemistry☆33Jan 30, 2026Updated 3 months ago
- Development/testing repo for SWIG+Fortran☆11Mar 25, 2018Updated 8 years ago
- ☆14Oct 8, 2016Updated 9 years ago
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆76Oct 22, 2025Updated 7 months ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- Generic exascale-ready library for halo-exchange operations on variety of grids/meshes☆10May 13, 2026Updated last week
- PyTorch implementation of joint coordinate and sparse parametric encodings for offline RGB-D surface reconstruction☆19May 13, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simple small molecular docking and conformation filtering tool.☆13Updated this week
- The Kokkos Fortran Interop repository contains tools and interfaces which help interactions between Fortran portions of an applications a…☆43Mar 12, 2026Updated 2 months ago
- Massively Asynchronous Coding Environment☆18Oct 21, 2012Updated 13 years ago
- GPU-Accelerated multigrid solver for Poisson's equation in 2D☆29Apr 2, 2026Updated last month
- ☆17Dec 10, 2018Updated 7 years ago
- High Performance Grouped GEMM in PyTorch☆30May 10, 2022Updated 4 years ago
- Multiprocessor Algorithms for Nonlinear Gradient-free Optimization☆12Jul 1, 2020Updated 5 years ago