open-mpi / ompiLinks
Open MPI main development repository
☆2,454Updated this week
Alternatives and similar repositories for ompi
Users that are interested in ompi are comparing it to the libraries listed below
Sorting:
- Official MPICH Repository☆642Updated this week
- Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)☆1,508Updated this week
- Optimized primitives for collective multi-GPU communication☆4,236Updated last week
- oneAPI Deep Neural Network Library (oneDNN)☆3,921Updated this week
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,276Updated 3 months ago
- Open Fabric Interfaces☆727Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction☆2,369Updated this week
- MPI programming lessons in C and executable code examples☆2,316Updated 2 months ago
- oneAPI Math Library (oneMath)☆724Updated 2 weeks ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,806Updated 2 years ago
- OpenHPC Integration, Packaging, and Test Repo☆947Updated last week
- LAPACK development repository☆1,746Updated last week
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆921Updated this week
- Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.☆1,387Updated this week
- RDMA core userspace libraries and daemons☆2,028Updated last week
- Hardware locality (hwloc)☆661Updated last week
- Official HDF5® Library Repository☆838Updated last week
- Slurm: A Highly Scalable Workload Manager☆3,436Updated this week
- HPCToolkit performance tools: measurement and analysis components☆344Updated 9 months ago
- Source code examples from the Parallel Forall Blog☆1,313Updated last month
- Collective communications library with various primitives for multi-machine training.☆1,369Updated last month
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,987Updated last year
- Official HPCG benchmark source code☆328Updated last year
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,331Updated 7 months ago
- NUMA support for Linux☆474Updated 2 weeks ago
- An HPC workload manager and job scheduler for desktops, clusters, and clouds.☆772Updated last week
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆7,112Updated 2 weeks ago
- NCCL Tests☆1,338Updated 2 weeks ago
- LaTeX Examples Document Source☆251Updated last week
- Patterns and behaviors for GPU computing☆1,746Updated 3 years ago