☆25May 19, 2021Updated 4 years ago
Alternatives and similar repositories for xccl
Users that are interested in xccl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch process group third-party plugin for UCC☆21Apr 15, 2024Updated last year
- RDMA and SHARP plugins for nccl library☆224Updated this week
- Unified Collective Communication Library☆297Updated this week
- pytorch ucc plugin☆23Jul 8, 2021Updated 4 years ago
- verbs profiling library☆22Sep 22, 2023Updated 2 years ago
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆62Jul 1, 2022Updated 3 years ago
- GPUDirect Async support for IB Verbs☆136Nov 10, 2022Updated 3 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆122Nov 15, 2023Updated 2 years ago
- ☆391Apr 23, 2024Updated last year
- RDMA core userspace libraries and daemons☆15Updated this week
- rdma编程学习☆25Dec 6, 2021Updated 4 years ago
- An HPL-AI implementation for Fugaku☆23Jun 29, 2021Updated 4 years ago
- NCCL Profiling Kit☆152Jul 1, 2024Updated last year
- Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)☆1,596Mar 15, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆145Updated this week
- OFED libibverbs tests package☆17Oct 5, 2021Updated 4 years ago
- A Golang Kubernetes client☆13Jun 27, 2025Updated 8 months ago
- Research Computing Framework Based on Singularity and Lmod☆10Aug 22, 2020Updated 5 years ago
- Infiniband verbs performance tests (fork of git://git.openfabrics.org/~grockah/perftest.git)☆20Jan 14, 2016Updated 10 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆413Updated this week
- K8s Multi- network service controller☆15Jun 18, 2019Updated 6 years ago
- ext_mpi_collectives☆11Apr 1, 2025Updated 11 months ago
- Sample Codes using NVSHMEM on Multi-GPU☆30Jan 22, 2023Updated 3 years ago
- This repository is now stale. You should be looking at the open-mpi/ompi repository instead.☆34Sep 21, 2016Updated 9 years ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆60Updated this week
- A low-level transport Linux kernel module for bulk low-latency data transfers between two SoCs over PCIe NTB☆19May 2, 2023Updated 2 years ago
- Synthesizer for optimal collective communication algorithms☆123Apr 8, 2024Updated last year
- Thunder Research Group's Collective Communication Library☆50Jul 8, 2025Updated 8 months ago
- An Open-Source Community Supported Fortran layer for AMD HIP☆10May 20, 2020Updated 5 years ago
- A kernel module to enable RDMA transfers to/from IO, PFN and DAX mapped memory☆10Jun 23, 2015Updated 10 years ago
- Linux Cross-Memory Attach☆97Feb 18, 2026Updated last month
- The operator manages the ovn-kube components running on the DPU card for enabling OVS hardware offloading.☆28Feb 13, 2026Updated last month
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆208Updated this week
- Microsoft Collective Communication Library☆387Sep 20, 2023Updated 2 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11May 11, 2018Updated 7 years ago
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,355Mar 12, 2026Updated last week
- MemEC: An Erasure-Coding-Based Distributed In-Memory Key-Value Store☆11Mar 30, 2017Updated 8 years ago
- Watts Up? Pro/.Net meter logger☆11Aug 10, 2021Updated 4 years ago
- This version of Chombo is fortran-free and depends on the Proto middleware infrastructure for performance portability.☆10Sep 12, 2025Updated 6 months ago