open-mpi / ompi
Open MPI main development repository
☆2,123Updated last week
Related projects: ⓘ
- Official MPICH Repository☆538Updated this week
- Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)☆1,115Updated this week
- Optimized primitives for collective multi-GPU communication☆3,132Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction☆1,855Updated this week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,669Updated 11 months ago
- Open Fabric Interfaces☆549Updated this week
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆839Updated this week
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆854Updated 2 months ago
- LAPACK development repository☆1,487Updated this week
- Slurm: A Highly Scalable Workload Manager☆2,579Updated this week
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,907Updated 7 months ago
- Hardware locality (hwloc)☆563Updated this week
- Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.☆1,215Updated this week
- oneAPI Deep Neural Network Library (oneDNN)☆3,579Updated this week
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆6,281Updated last week
- NCCL Tests☆819Updated last month
- oneAPI Math Kernel Library (oneMKL) Interfaces☆606Updated last week
- HPCToolkit performance tools: measurement and analysis components☆330Updated this week
- OpenHPC Integration, Packaging, and Test Repo☆856Updated this week
- Source code examples from the Parallel Forall Blog☆1,223Updated last month
- Collective communications library with various primitives for multi-machine training.☆1,192Updated 2 months ago
- Patterns and behaviors for GPU computing☆1,638Updated 2 years ago
- MPI programming lessons in C and executable code examples☆2,170Updated last month
- Performance monitoring and benchmarking suite☆1,648Updated this week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,293Updated 7 months ago
- CUDA Core Compute Libraries☆1,132Updated this week
- Python bindings for MPI☆790Updated this week
- HIP: C++ Heterogeneous-Compute Interface for Portability☆3,690Updated this week
- RDMA core userspace libraries and daemons☆1,487Updated this week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆528Updated last month