This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.
☆205Feb 27, 2026Updated last week
Alternatives and similar repositories for aws-ofi-nccl
Users that are interested in aws-ofi-nccl are comparing it to the libraries listed below
Sorting:
- Open Fabric Interfaces☆764Updated this week
- NCCL Profiling Kit☆152Jul 1, 2024Updated last year
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆44Sep 19, 2023Updated 2 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆122Nov 15, 2023Updated 2 years ago
- RDMA core userspace libraries and daemons☆15Feb 16, 2026Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆410Updated this week
- AWS Libfabric☆45Jan 29, 2026Updated last month
- Microsoft Collective Communication Library☆385Sep 20, 2023Updated 2 years ago
- RDMA and SHARP plugins for nccl library☆224Jan 12, 2026Updated last month
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,347Dec 17, 2025Updated 2 months ago
- NCCL Tests☆1,446Feb 9, 2026Updated 3 weeks ago
- Optimized primitives for collective multi-GPU communication☆4,495Updated this week
- Unified Collective Communication Library☆293Feb 27, 2026Updated last week
- Linux Cross-Memory Attach☆97Feb 18, 2026Updated 2 weeks ago
- ☆17Nov 11, 2025Updated 3 months ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆78Feb 24, 2026Updated last week
- ☆387Apr 23, 2024Updated last year
- Reference implementations of MLPerf™ HPC training benchmarks☆50Feb 25, 2025Updated last year
- Infiniband Verbs Performance Tests☆919Updated this week
- Deploying EFA in EKS utilizing GPUDirectRDMA where supported☆36Oct 15, 2024Updated last year
- Synthesizer for optimal collective communication algorithms☆124Apr 8, 2024Updated last year
- Benchmarks☆18Apr 28, 2025Updated 10 months ago
- ☆70Feb 10, 2025Updated last year
- Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)☆1,583Updated this week
- NVIDIA Inference Xfer Library (NIXL)☆898Feb 28, 2026Updated last week
- ☆26May 19, 2021Updated 4 years ago
- ☆11Feb 17, 2026Updated 2 weeks ago
- A continuous integration (CI) system for 📓 Jupyter notebooks, built using 🧠 Amazon SageMaker.☆11Aug 5, 2025Updated 7 months ago
- AWS Slurm Cluster for EDA Workloads☆27Aug 20, 2025Updated 6 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Feb 16, 2026Updated 2 weeks ago
- Repository linking to the software artifacts used for the MigrOS ATC 2021 paper☆18May 31, 2021Updated 4 years ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆34Jan 30, 2026Updated last month
- ☆57Dec 12, 2025Updated 2 months ago
- The Singularity SPANK plugin provides the users with an interface to launch an application within a Linux container.☆11Nov 4, 2025Updated 4 months ago
- Oak Ridge OpenSHMEM Benchmarks☆15Jun 26, 2018Updated 7 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11May 11, 2018Updated 7 years ago
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.☆16Nov 7, 2025Updated 3 months ago
- Damselfly Network Simulator☆10Nov 19, 2020Updated 5 years ago
- Build scripts for PyTorch @ NERSC☆12Dec 27, 2025Updated 2 months ago