spcl / muliticast-based-allgatherView external linksLinks
☆23Feb 12, 2025Updated last year
Alternatives and similar repositories for muliticast-based-allgather
Users that are interested in muliticast-based-allgather are comparing it to the libraries listed below
Sorting:
- Demystifying Datapath Accelerator Enhanced Off-path SmartNIC [ICNP24]☆56Dec 5, 2024Updated last year
- Benchmark Suite for RDMA Performance Isolation☆41Sep 5, 2023Updated 2 years ago
- Research paper list for host networking: in a system view☆10Jan 2, 2025Updated last year
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆13Dec 9, 2024Updated last year
- CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access [ICDE'25]☆18Mar 3, 2025Updated 11 months ago
- A rust-based benchmark for BlueField SmartNICs.☆30Jul 5, 2023Updated 2 years ago
- ☆18Dec 11, 2023Updated 2 years ago
- [NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training☆30May 2, 2025Updated 9 months ago
- A curated list of awesome smartnic tutorials, papers and projects.☆291Oct 27, 2025Updated 3 months ago
- NCCL Profiling Kit☆152Jul 1, 2024Updated last year
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆464Updated this week
- Benchmark Test Suite for RDMA Networks☆59Apr 15, 2023Updated 2 years ago
- An Automated Performance Optimization Framework for P4-Programmable SmartNICs☆28Nov 18, 2023Updated 2 years ago
- Venus Collective Communication Library, supported by SII and Infrawaves.☆138Updated this week
- ☆46Nov 24, 2025Updated 2 months ago
- ☆70Feb 13, 2022Updated 4 years ago
- Ensō is a high-performance streaming interface for NIC-application communication.☆76Sep 4, 2025Updated 5 months ago
- DUA, is a communication architecture that provides uniform access for FPGA to data center resources. Without being limited by machine bou…☆40Aug 30, 2022Updated 3 years ago
- [ACM CoNEXT22 Best Paper Award] NTSocks: An ultra-low latency and compatible PCIe interconnect for rack-scale disaggregation.☆41Jul 11, 2024Updated last year
- Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks☆101Sep 2, 2021Updated 4 years ago
- An EDM-enabled PHY + a rack-level network simulator☆13Dec 11, 2024Updated last year
- Latr: Lazy Translation Coherence - ASPLOS'18☆16Nov 15, 2021Updated 4 years ago
- λFS: an elastic, high-performance, serverless-function-based metadata service for large-scale distributed file systems (ACM ASPLOS'23)☆14Apr 2, 2025Updated 10 months ago
- ☆11Aug 4, 2022Updated 3 years ago
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- FeRTOS is a simple "operating system" that currently supports ARM Cortex-M CPUs☆13Jul 9, 2022Updated 3 years ago
- ☆13Apr 1, 2017Updated 8 years ago
- Multi-platform topology-aware memory management library☆13Apr 23, 2020Updated 5 years ago
- A gitbook named studying-containerd-notes☆10Dec 17, 2018Updated 7 years ago
- Prototype of the system described in "Trace Types and Denotational Semantics for Sound Programmable Inference in Probabilistic Languages"☆11Aug 8, 2025Updated 6 months ago
- NetBricks: A new network function framework based on Rust.☆12Jan 2, 2026Updated last month
- Accepted to MLSys 2026☆70Jan 29, 2026Updated 2 weeks ago
- Resources on the Artifact Evaluation (AE) Process☆17Jan 29, 2021Updated 5 years ago
- Development work for qemu☆10Aug 22, 2016Updated 9 years ago
- ☆11May 30, 2023Updated 2 years ago
- ☆14Jun 17, 2024Updated last year
- Next-generation datacenter OS built on kernel bypass to speed up unmodified code while improving platform density and security☆119Nov 14, 2025Updated 3 months ago
- A new DRAM substrate that mitigates the excessive energy consumption from both (i) transmitting unused data on the memory channel and (i…☆13Aug 23, 2024Updated last year
- ☆10Apr 29, 2023Updated 2 years ago