open-mpi / ompiLinks
Open MPI main development repository
☆2,410Updated last week
Alternatives and similar repositories for ompi
Users that are interested in ompi are comparing it to the libraries listed below
Sorting:
- Official MPICH Repository☆633Updated last week
- Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)☆1,452Updated this week
- Optimized primitives for collective multi-GPU communication☆4,051Updated this week
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,213Updated last month
- Slurm: A Highly Scalable Workload Manager☆3,289Updated this week
- Open Fabric Interfaces☆701Updated last week
- RDMA core userspace libraries and daemons☆1,958Updated this week
- MPI programming lessons in C and executable code examples☆2,299Updated 2 months ago
- Hardware locality (hwloc)☆649Updated this week
- OpenHPC Integration, Packaging, and Test Repo☆933Updated this week
- Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.☆1,365Updated this week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,786Updated last year
- Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction☆2,323Updated this week
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,986Updated last year
- LAPACK development repository☆1,712Updated this week
- Official HDF5® Library Repository☆794Updated this week
- oneAPI Math Library (oneMath)☆714Updated last month
- HPCToolkit performance tools: measurement and analysis components☆344Updated 7 months ago
- CUDA Core Compute Libraries☆1,913Updated this week
- Collective communications library with various primitives for multi-machine training.☆1,353Updated last week
- oneAPI Deep Neural Network Library (oneDNN)☆3,884Updated last week
- Performance monitoring and benchmarking suite☆1,822Updated this week
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆900Updated last week
- Source code examples from the Parallel Forall Blog☆1,302Updated last year
- ☆171Updated 2 weeks ago
- NCCL Tests☆1,265Updated 2 weeks ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆455Updated 3 weeks ago
- An HPC workload manager and job scheduler for desktops, clusters, and clouds.☆759Updated 2 weeks ago
- Official HPCG benchmark source code☆327Updated last year
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,323Updated 5 months ago