open-mpi / ompiLinks
Open MPI main development repository
☆2,499Updated this week
Alternatives and similar repositories for ompi
Users that are interested in ompi are comparing it to the libraries listed below
Sorting:
- Official MPICH Repository☆650Updated 2 weeks ago
- Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)☆1,538Updated last week
- Slurm: A Highly Scalable Workload Manager☆3,621Updated this week
- Optimized primitives for collective multi-GPU communication☆4,352Updated last week
- MPI programming lessons in C and executable code examples☆2,327Updated last week
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,312Updated 2 weeks ago
- OpenHPC Integration, Packaging, and Test Repo☆957Updated last week
- LAPACK development repository☆1,775Updated 2 weeks ago
- Open Fabric Interfaces☆745Updated this week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,812Updated 2 years ago
- RDMA core userspace libraries and daemons☆2,098Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction☆2,414Updated this week
- Hardware locality (hwloc)☆669Updated 2 weeks ago
- Collective communications library with various primitives for multi-machine training.☆1,384Updated last month
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆929Updated this week
- Official HDF5® Library Repository☆868Updated this week
- oneAPI Math Library (oneMath)☆737Updated 3 weeks ago
- NCCL Tests☆1,389Updated last week
- An HPC workload manager and job scheduler for desktops, clusters, and clouds.☆780Updated last month
- CUDA Core Compute Libraries☆2,102Updated this week
- Source code examples from the Parallel Forall Blog☆1,314Updated 3 months ago
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,999Updated last year
- Official HPCG benchmark source code☆334Updated last year
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆850Updated 3 months ago
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,342Updated 8 months ago
- HPCToolkit performance tools: measurement and analysis components☆344Updated 10 months ago
- Reference implementations of MLPerf® training benchmarks☆1,736Updated 3 weeks ago
- BLAS-like Library Instantiation Software Framework☆2,582Updated last month
- CUDA Library Samples☆2,265Updated 2 weeks ago
- HPC Container Maker☆502Updated 2 weeks ago