Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceleration.
☆31Jun 26, 2024Updated last year
Alternatives and similar repositories for spla
Users that are interested in spla are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The CSCS ReFrame test suite☆15Updated this week
- Porting meshing tools and solvers that deal with unstructured meshes on GPUs☆15Apr 21, 2026Updated last week
- ☆22Apr 21, 2026Updated last week
- DLA-Future☆85Mar 27, 2026Updated last month
- Base container for developing C++ and Fortran HPC applications☆18Jun 14, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.☆13Nov 3, 2023Updated 2 years ago
- Domain specific library for electronic structure calculations☆163Updated this week
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆26Updated this week
- MPI+Kokkos implementation of spectral difference method (SDM) high order schemes☆29Feb 2, 2025Updated last year
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Feb 28, 2019Updated 7 years ago
- Netlib Scalapack with robust CMake☆14Mar 26, 2026Updated last month
- CSCS public documentation☆31Apr 23, 2026Updated last week
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆18Aug 21, 2023Updated 2 years ago
- Frame-to-Frame Registration using Gaussian Mixture Models.☆23Mar 2, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Jun 24, 2021Updated 4 years ago
- A SCVT mesh generation tool☆13Nov 28, 2020Updated 5 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆79Mar 27, 2023Updated 3 years ago
- Recipes for software stacks on Alps vClusters.☆15Updated this week
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Jul 25, 2025Updated 9 months ago
- Userspace eBPF Runtime Benchmarking Test Suite and Results☆16Updated this week
- C++17 Wrapper for ScaLAPACK☆11Oct 5, 2023Updated 2 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated last year
- Code for Hutch++: Optimal Stochastic Trace Estimation☆13Aug 30, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An MLIR-based AI compiler designed for Python frontend to RISC-V DSA☆14Oct 10, 2024Updated last year
- C++ library for graph ordering☆15Mar 20, 2020Updated 6 years ago
- A Monte Carlo Neutron Transport Mini-App☆15Apr 15, 2019Updated 7 years ago
- ☆11Aug 8, 2021Updated 4 years ago
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆153Updated this week
- Development/testing repo for SWIG+Fortran☆11Mar 25, 2018Updated 8 years ago
- JIT-compiled GPU kernels for quantum chemistry☆32Jan 30, 2026Updated 3 months ago
- ☆14Oct 8, 2016Updated 9 years ago
- LAPACK++ is a C++ wrapper around CPU and GPU LAPACK and LAPACK-like linear algebra libraries, developed as part of the SLATE project.☆76Oct 22, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- Generic exascale-ready library for halo-exchange operations on variety of grids/meshes☆10Updated this week
- PyTorch implementation of joint coordinate and sparse parametric encodings for offline RGB-D surface reconstruction☆19May 13, 2023Updated 2 years ago
- Simple small molecular docking and conformation filtering tool.☆13Updated this week
- The Kokkos Fortran Interop repository contains tools and interfaces which help interactions between Fortran portions of an applications a…☆41Mar 12, 2026Updated last month
- GPU-Accelerated multigrid solver for Poisson's equation in 2D☆29Apr 2, 2026Updated last month
- ☆17Dec 10, 2018Updated 7 years ago