LLNL / AluminumLinks
High-performance, GPU-aware communication library
☆87Updated 5 months ago
Alternatives and similar repositories for Aluminum
Users that are interested in Aluminum are comparing it to the libraries listed below
Sorting:
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆59Updated 2 years ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆24Updated 7 years ago
- ☆24Updated 4 years ago
- Reference implementations of MLPerf™ HPC training benchmarks☆48Updated 4 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- This is a mirror of https://gitlab.inria.fr/starpu/starpu where our development happens, but contributions are welcome here too!☆73Updated this week
- Subset of BLAS routines optimized for NVIDIA GPUs☆69Updated 2 years ago
- Logger for MPI communication☆27Updated last year
- OpenSHMEM Application Programming Interface☆57Updated 7 months ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆58Updated last week
- oneAPI Collective Communications Library (oneCCL)☆237Updated 2 weeks ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆33Updated 2 months ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆70Updated 2 months ago
- GPUDirect Async support for IB Verbs☆121Updated 2 years ago
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆108Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆119Updated this week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆90Updated this week
- CUDA Tensor Transpose (cuTT) library☆52Updated 7 years ago
- ☆44Updated 4 years ago
- A hierarchical collective communications library with portable optimizations☆35Updated 6 months ago
- OpenSHMEM Implementation on MPI☆26Updated 3 months ago
- A Micro-benchmarking Tool for HPC Networks☆31Updated 5 months ago
- Advanced Profiling and Analytics for AMD Hardware☆157Updated this week
- A light-weight MPI profiler.☆95Updated 11 months ago
- RAJA Performance Suite☆117Updated this week
- Autonomic Performance Environment for eXascale (APEX)☆48Updated last month
- Distributed View Extension for Kokkos☆46Updated 6 months ago
- A task benchmark☆43Updated 10 months ago
- ROCm SPARSE marshalling library☆67Updated this week
- Integrated Performance Monitoring for High Performance Computing☆89Updated 3 years ago