High-performance, GPU-aware communication library
☆90Dec 16, 2025Updated 5 months ago
Alternatives and similar repositories for Aluminum
Users that are interested in Aluminum are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High performance NCCL plugin for Bagua.☆15Sep 15, 2021Updated 4 years ago
- Comb is a communication performance benchmarking tool.☆25Feb 27, 2023Updated 3 years ago
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆72Mar 17, 2025Updated last year
- Livermore Big Artificial Neural Network Toolkit☆231Apr 8, 2026Updated 2 months ago
- Distributed Communication-Optimal Shuffle and Transpose Algorithm☆14Apr 18, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated last year
- C++17 Wrapper for ScaLAPACK☆11Oct 5, 2023Updated 2 years ago
- A GPU performance prediction toolkit for CUDA programs☆18Mar 25, 2019Updated 7 years ago
- SN Application Proxy☆52Jun 22, 2022Updated 3 years ago
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆31Jun 5, 2026Updated last week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆101Jun 4, 2026Updated last week
- Pragmatic, Productive, and Portable Affinity for HPC☆52Mar 8, 2026Updated 3 months ago
- OCCA Python API: JIT Compilation for Multiple Architectures☆11Dec 20, 2019Updated 6 years ago
- High-order Lagrangian Hydrodynamics Miniapp☆206May 30, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Near-optimal Prefetching System☆33Nov 17, 2021Updated 4 years ago
- Bagua tutorials.☆13Sep 4, 2022Updated 3 years ago
- Parallel fast Fourier transforms☆59Jan 8, 2019Updated 7 years ago
- DLA-Future☆85Jun 1, 2026Updated last week
- Mini-applications that exclusively use the Kokkos programming model☆12Mar 21, 2023Updated 3 years ago
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆23Oct 11, 2024Updated last year
- An application-focused API for memory management on NUMA & GPU architectures☆404Jun 5, 2026Updated last week
- Parallel GDB developed for debugging HPC code at Lawrence Livermore National Laboratory.☆32Nov 3, 2015Updated 10 years ago
- SST DUMPI Trace Library☆14Apr 24, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Portable HPC Containers (C++)☆49May 25, 2026Updated 2 weeks ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Mar 5, 2026Updated 3 months ago
- Large-scale Visualization Data Storage in Python☆20Apr 24, 2026Updated last month
- Damselfly Network Simulator☆10Nov 19, 2020Updated 5 years ago
- A pseudo random number generator library written against the SYCL API.☆11Jun 11, 2019Updated 7 years ago
- Fork of cyclops-community/ctf repository updated haphazardly, previously this was main repo location☆10Aug 7, 2018Updated 7 years ago
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,380Mar 12, 2026Updated 3 months ago
- A dynamic analysis tool to detect floating-point errors in HPC applications.☆40Jun 6, 2026Updated last week
- Unified Collective Communication Library☆307Jun 3, 2026Updated last week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Library for generating C and Fortran bindings for C++ functions from C++☆18Feb 2, 2021Updated 5 years ago
- Kubernetes operator for Bagua distributed training job.☆13Feb 7, 2023Updated 3 years ago
- ☆11Aug 8, 2021Updated 4 years ago
- Multidimensional arrays for C++. (Not an official Boost library) \\ This is a mirror of gitlab.com/correaa/boost-multi☆20Updated this week
- ☆44Jun 3, 2024Updated 2 years ago
- Astrophysics MHD simulation code optimized for large cluster of GPU☆61Dec 20, 2024Updated last year
- STREAM, for lots of devices written in many programming models☆363Apr 10, 2026Updated 2 months ago