aws / aws-ofi-ncclView external linksLinks
This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.
☆204Feb 6, 2026Updated last week
Alternatives and similar repositories for aws-ofi-nccl
Users that are interested in aws-ofi-nccl are comparing it to the libraries listed below
Sorting:
- Open Fabric Interfaces☆759Updated this week
- NCCL Profiling Kit☆152Jul 1, 2024Updated last year
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆43Sep 19, 2023Updated 2 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆122Nov 15, 2023Updated 2 years ago
- RDMA core userspace libraries and daemons☆15Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆410Feb 7, 2026Updated last week
- AWS Libfabric☆45Jan 29, 2026Updated 2 weeks ago
- RDMA and SHARP plugins for nccl library☆223Jan 12, 2026Updated last month
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,339Dec 17, 2025Updated last month
- NCCL Tests☆1,427Updated this week
- Optimized primitives for collective multi-GPU communication☆4,436Feb 3, 2026Updated last week
- Unified Collective Communication Library☆291Jan 30, 2026Updated 2 weeks ago
- Linux Cross-Memory Attach☆96Sep 11, 2024Updated last year
- ☆17Nov 11, 2025Updated 3 months ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆77Aug 27, 2025Updated 5 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆49Feb 25, 2025Updated 11 months ago
- Infiniband Verbs Performance Tests☆910Jan 11, 2026Updated last month
- Synthesizer for optimal collective communication algorithms☆124Apr 8, 2024Updated last year
- ☆71Feb 10, 2025Updated last year
- Benchmarks☆17Apr 28, 2025Updated 9 months ago
- Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)☆1,573Updated this week
- NVIDIA Inference Xfer Library (NIXL)☆876Updated this week
- ☆26May 19, 2021Updated 4 years ago
- A continuous integration (CI) system for 📓 Jupyter notebooks, built using 🧠 Amazon SageMaker.☆11Aug 5, 2025Updated 6 months ago
- ☆11Jan 21, 2026Updated 3 weeks ago
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.☆15Nov 7, 2025Updated 3 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Feb 3, 2026Updated last week
- Repository linking to the software artifacts used for the MigrOS ATC 2021 paper☆18May 31, 2021Updated 4 years ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆34Jan 30, 2026Updated 2 weeks ago
- ☆56Dec 12, 2025Updated 2 months ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11May 11, 2018Updated 7 years ago
- Damselfly Network Simulator☆10Nov 19, 2020Updated 5 years ago
- Build scripts for PyTorch @ NERSC☆12Dec 27, 2025Updated last month
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆64Jun 30, 2025Updated 7 months ago
- NVIDIA NCCL Tests for Distributed Training☆136Jan 27, 2026Updated 2 weeks ago
- Collective communications library with various primitives for multi-machine training.☆1,396Updated this week
- verbs profiling library☆22Sep 22, 2023Updated 2 years ago
- Integrated Performance Monitoring for High Performance Computing☆91Nov 5, 2021Updated 4 years ago
- ☆15Apr 21, 2025Updated 9 months ago