eth-cscs/COSMA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eth-cscs/COSMA)

eth-cscs / COSMA

Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm

☆215

Alternatives and similar repositories for COSMA

Users that are interested in COSMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eth-cscs / Tiled-MM
View on GitHub
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
☆33Apr 2, 2025Updated last year
eth-cscs / COSTA
View on GitHub
Distributed Communication-Optimal Shuffle and Transpose Algorithm
☆14Apr 18, 2026Updated 3 months ago
eth-cscs / SpFFT
View on GitHub
Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support
☆55Jul 25, 2025Updated 11 months ago
eth-cscs / conflux
View on GitHub
Distributed Communication-Optimal LU-factorization Algorithm
☆12Aug 1, 2021Updated 4 years ago
cp2k / dbcsr
View on GitHub
DBCSR: Distributed Block Compressed Sparse Row matrix library
☆155Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
eth-cscs / DLA-Future
View on GitHub
DLA-Future
☆85Jun 19, 2026Updated last month
ecrc / hicma
View on GitHub
HiCMA: Hierarchical Computations on Manycore Architectures
☆37Mar 19, 2023Updated 3 years ago
electronic-structure / SIRIUS
View on GitHub
Domain specific library for electronic structure calculations
☆167Updated this week
eth-cscs / spla
View on GitHub
Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…
☆32Jun 26, 2024Updated 2 years ago
libxsmm / libxsmm
View on GitHub
Library for specialized dense and sparse matrix operations, and deep learning primitives.
☆968Updated this week
eth-cscs / cscs-reframe-tests
View on GitHub
The CSCS ReFrame test suite
☆15Updated this week
AnonymousYWL / MYLIB
View on GitHub
☆18Apr 8, 2022Updated 4 years ago
Reference-ScaLAPACK / scalapack
View on GitHub
ScaLAPACK development repository
☆169Jul 10, 2026Updated last week
llnl / Umpire
View on GitHub
An application-focused API for memory management on NUMA & GPU architectures
☆416Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
spcl / dace
View on GitHub
DaCe - Data Centric Parallel Programming
☆591Updated this week
spcl / absinthe
View on GitHub
Absinthe is an optimization framework to fuse and tile stencil codes in one shot
☆14Jul 17, 2019Updated 7 years ago
ConnollyLeon / awesome-Auto-Parallelism
View on GitHub
A baseline repository of Auto-Parallelism in Training Neural Networks
☆145Jun 25, 2022Updated 4 years ago
reframe-hpc / reframe
View on GitHub
A powerful Python framework for writing and running portable regression tests and benchmarks for HPC systems.
☆279Updated this week
ginkgo-project / ginkgo
View on GitHub
Numerical linear algebra software package
☆611Updated this week
eth-cscs / pyfirecrest
View on GitHub
Python wrappers for the FirecREST API
☆12Jun 21, 2026Updated last month
deep500 / deep500
View on GitHub
A Deep Learning Meta-Framework and HPC Benchmarking Library
☆81May 23, 2022Updated 4 years ago
jeffhammond / HPCInfo
View on GitHub
Information about many aspects of high-performance computing. Wiki content moved to ~/docs.
☆317Jun 13, 2026Updated last month
spcl / FBLAS
View on GitHub
BLAS implementation for Intel FPGA
☆78Nov 18, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
eth-cscs / stackinator
View on GitHub
☆23Jul 10, 2026Updated last week
ParRes / Kernels
View on GitHub
This is a set of simple programs that can be used to explore the features of a parallel platform.
☆475Jan 27, 2026Updated 5 months ago
hpc / pavilion2
View on GitHub
Pavilion is a Python 3 (3.6+) based framework for running and analyzing tests targeting HPC systems.
☆46Updated this week
nomad-coe / greenX
View on GitHub
Library for Green’s function based electronic structure theory calculations
☆28Apr 16, 2026Updated 3 months ago
GridTools / gt4py
View on GitHub
Python library for generating high-performance implementations of stencil kernels for weather and climate modeling from a domain-specific…
☆147Updated this week
AMDResearch / DAGEE
View on GitHub
Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…
☆49Oct 12, 2021Updated 4 years ago
eth-cscs / ext_mpi_collectives
View on GitHub
ext_mpi_collectives
☆11Jun 3, 2026Updated last month
ROCm / Tensile
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆260Updated this week
NVIDIA / multi-gpu-programming-models
View on GitHub
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
☆909Sep 26, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
at-aaims / OpenMxP
View on GitHub
This is the open source version of HPL-MXP. The code performance has been verified on Frontier
☆18Jul 9, 2025Updated last year
openucx / ucc
View on GitHub
Unified Collective Communication Library
☆310Updated this week
cyclops-community / ctf
View on GitHub
Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays
☆216Jun 11, 2025Updated last year
icl-utk-edu / slate
View on GitHub
SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…
☆135Oct 21, 2025Updated 9 months ago
UoB-HPC / minifmm
View on GitHub
☆11Aug 8, 2021Updated 4 years ago
UoB-HPC / BabelStream
View on GitHub
STREAM, for lots of devices written in many programming models
☆368Jun 15, 2026Updated last month
ROCm / hipFFT
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆65Updated this week