☆24Feb 12, 2025Updated last year
Alternatives and similar repositories for muliticast-based-allgather
Users that are interested in muliticast-based-allgather are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Demystifying Datapath Accelerator Enhanced Off-path SmartNIC [ICNP24]☆60Dec 5, 2024Updated last year
- [NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training☆32May 2, 2025Updated last year
- Benchmark Suite for RDMA Performance Isolation☆42Sep 5, 2023Updated 2 years ago
- A rust-version of NVIDIA BlueField DOCA kit.☆14Jun 11, 2023Updated 3 years ago
- A rust-based benchmark for BlueField SmartNICs.☆30Jul 5, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆15Dec 9, 2024Updated last year
- NCCL Profiling Kit☆155Jul 1, 2024Updated last year
- ☆18Dec 11, 2023Updated 2 years ago
- A collection of tools, code, and documentation to understand the host network on real server hardware.☆46Dec 1, 2024Updated last year
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆532Updated this week
- CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access [ICDE'25]☆19Mar 3, 2025Updated last year
- ☆19Jan 24, 2019Updated 7 years ago
- λFS: an elastic, high-performance, serverless-function-based metadata service for large-scale distributed file systems (ACM ASPLOS'23)☆14Apr 2, 2025Updated last year
- A curated list of awesome smartnic tutorials, papers and projects.☆299Oct 27, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An EDM-enabled PHY + a rack-level network simulator☆14Dec 11, 2024Updated last year
- Demo for testing dynamically load the libos module.☆10Nov 8, 2023Updated 2 years ago
- Paper list of federated learning: About system design☆13Apr 13, 2022Updated 4 years ago
- Benchmark Test Suite for RDMA Networks☆60Apr 15, 2023Updated 3 years ago
- An Automated Performance Optimization Framework for P4-Programmable SmartNICs☆28Nov 18, 2023Updated 2 years ago
- ☆27Jun 26, 2022Updated 3 years ago
- Ensō is a high-performance streaming interface for NIC-application communication.☆80Apr 11, 2026Updated 2 months ago
- The source code of INFless,a native serverless platform for AI inference.☆10Oct 10, 2022Updated 3 years ago
- ☆71Feb 13, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- COCCL: Compression and precision co-aware collective communication library☆33Mar 16, 2025Updated last year
- How to use node-local MPI rank IDs to manually map MPI ranks to GPUs☆14Apr 22, 2020Updated 6 years ago
- Speed of Light Analysis for ML Model Runtime☆77Jun 10, 2026Updated last week
- Overcoming the IOTLB Wall for Multi-100-Gbps Linux-based Networking☆23May 16, 2023Updated 3 years ago
- DUA, is a communication architecture that provides uniform access for FPGA to data center resources. Without being limited by machine bou…☆40Aug 30, 2022Updated 3 years ago
- RPerf: Accurate Latency Measurement Framework for RDMA☆15Apr 14, 2026Updated 2 months ago
- NetBricks: A new network function framework based on Rust.☆12Jan 2, 2026Updated 5 months ago
- Simulator that maintains coherent caches for 4, 8 and 16 core CMP. Implementation of MSI, MESI, MOSI, MOESI and MOESIF protocols for a b…☆11Jan 6, 2015Updated 11 years ago
- ☆54Jun 7, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Skills for writing tilelang and debugging with CUDA toolkits.☆126May 20, 2026Updated 3 weeks ago
- Offloading DOCA-based Adaptive Routing onto NVIDIA BlueField-2 DPU. (卸载基于DOCA的自适应路由到NVIDIA BlueField-2 DPU上)☆19Jan 8, 2024Updated 2 years ago
- Venus Collective Communication Library, supported by SII and Infrawaves.☆147Jun 8, 2026Updated last week
- using raw socket to send ip packets☆10Jan 17, 2019Updated 7 years ago
- Landing page for Software for Open Networking in the Cloud (SONiC) - http://azure.github.io/SONiC/☆13Updated this week
- ☆14May 18, 2017Updated 9 years ago
- A WIP project based on CAP-VM☆18Nov 9, 2023Updated 2 years ago