Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceleration.
☆31Jun 26, 2024Updated last year
Alternatives and similar repositories for spla
Users that are interested in spla are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The CSCS ReFrame test suite☆15Apr 1, 2026Updated last week
- Porting meshing tools and solvers that deal with unstructured meshes on GPUs☆15Mar 12, 2026Updated 3 weeks ago
- ☆22Apr 1, 2026Updated last week
- DLA-Future☆85Mar 27, 2026Updated 2 weeks ago
- Base container for developing C++ and Fortran HPC applications☆18Jun 14, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.☆13Nov 3, 2023Updated 2 years ago
- Domain specific library for electronic structure calculations☆163Apr 2, 2026Updated last week
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆25Mar 31, 2026Updated last week
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Feb 28, 2019Updated 7 years ago
- Netlib Scalapack with robust CMake☆14Mar 26, 2026Updated 2 weeks ago
- CSCS public documentation☆30Updated this week
- Frame-to-Frame Registration using Gaussian Mixture Models.☆23Mar 2, 2024Updated 2 years ago
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Jun 24, 2021Updated 4 years ago
- A SCVT mesh generation tool☆13Nov 28, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Subset of BLAS routines optimized for NVIDIA GPUs☆78Mar 27, 2023Updated 3 years ago
- Recipes for software stacks on Alps vClusters.☆15Updated this week
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Jul 25, 2025Updated 8 months ago
- Userspace eBPF Runtime Benchmarking Test Suite and Results☆16Updated this week
- C++17 Wrapper for ScaLAPACK☆11Oct 5, 2023Updated 2 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated last year
- C++ library for graph ordering☆15Mar 20, 2020Updated 6 years ago
- A Monte Carlo Neutron Transport Mini-App☆15Apr 15, 2019Updated 6 years ago
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆153Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆11Aug 8, 2021Updated 4 years ago
- Development/testing repo for SWIG+Fortran☆11Mar 25, 2018Updated 8 years ago
- JIT-compiled GPU kernels for quantum chemistry☆32Jan 30, 2026Updated 2 months ago
- ☆14Oct 8, 2016Updated 9 years ago
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆76Oct 22, 2025Updated 5 months ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- PyTorch implementation of joint coordinate and sparse parametric encodings for offline RGB-D surface reconstruction☆19May 13, 2023Updated 2 years ago
- Simple small molecular docking and conformation filtering tool.☆13Updated this week
- The Kokkos Fortran Interop repository contains tools and interfaces which help interactions between Fortran portions of an applications a…☆40Mar 12, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Massively Asynchronous Coding Environment☆18Oct 21, 2012Updated 13 years ago
- GPU-Accelerated multigrid solver for Poisson's equation in 2D☆29Apr 2, 2026Updated last week
- ☆14Sep 22, 2019Updated 6 years ago
- Multiprocessor Algorithms for Nonlinear Gradient-free Optimization☆12Jul 1, 2020Updated 5 years ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆213Mar 25, 2026Updated 2 weeks ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Mar 5, 2026Updated last month
- Double precision raytracer for scientific or engineering applications.☆12May 18, 2024Updated last year