☆23Feb 12, 2025Updated last year
Alternatives and similar repositories for muliticast-based-allgather
Users that are interested in muliticast-based-allgather are comparing it to the libraries listed below
Sorting:
- Demystifying Datapath Accelerator Enhanced Off-path SmartNIC [ICNP24]☆56Dec 5, 2024Updated last year
- Benchmark Suite for RDMA Performance Isolation☆41Sep 5, 2023Updated 2 years ago
- Research paper list for host networking: in a system view☆10Jan 2, 2025Updated last year
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆14Dec 9, 2024Updated last year
- A rust-version of NVIDIA BlueField DOCA kit.☆14Jun 11, 2023Updated 2 years ago
- CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access [ICDE'25]☆18Mar 3, 2025Updated last year
- A rust-based benchmark for BlueField SmartNICs.☆30Jul 5, 2023Updated 2 years ago
- ☆18Dec 11, 2023Updated 2 years ago
- ☆18Jan 24, 2019Updated 7 years ago
- A collection of tools, code, and documentation to understand the host network on real server hardware.☆44Dec 1, 2024Updated last year
- [NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training☆31May 2, 2025Updated 10 months ago
- Overcoming the IOTLB Wall for Multi-100-Gbps Linux-based Networking☆24May 16, 2023Updated 2 years ago
- NCCL Profiling Kit☆152Jul 1, 2024Updated last year
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆476Updated this week
- Benchmark Test Suite for RDMA Networks☆59Apr 15, 2023Updated 2 years ago
- ☆26Jun 26, 2022Updated 3 years ago
- An Automated Performance Optimization Framework for P4-Programmable SmartNICs☆28Nov 18, 2023Updated 2 years ago
- Venus Collective Communication Library, supported by SII and Infrawaves.☆138Updated this week
- Ensō is a high-performance streaming interface for NIC-application communication.☆78Sep 4, 2025Updated 6 months ago
- ☆70Feb 13, 2022Updated 4 years ago
- DUA, is a communication architecture that provides uniform access for FPGA to data center resources. Without being limited by machine bou…☆40Aug 30, 2022Updated 3 years ago
- [ACM CoNEXT22 Best Paper Award] NTSocks: An ultra-low latency and compatible PCIe interconnect for rack-scale disaggregation.☆41Jul 11, 2024Updated last year
- Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks☆101Sep 2, 2021Updated 4 years ago
- How to use node-local MPI rank IDs to manually map MPI ranks to GPUs☆14Apr 22, 2020Updated 5 years ago
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- Latr: Lazy Translation Coherence - ASPLOS'18☆16Nov 15, 2021Updated 4 years ago
- ☆11Aug 4, 2022Updated 3 years ago
- FeRTOS is a simple "operating system" that currently supports ARM Cortex-M CPUs☆13Jul 9, 2022Updated 3 years ago
- ☆14Jun 17, 2024Updated last year
- Multi-platform topology-aware memory management library☆13Apr 23, 2020Updated 5 years ago
- Prototype of the system described in "Trace Types and Denotational Semantics for Sound Programmable Inference in Probabilistic Languages"☆11Aug 8, 2025Updated 7 months ago
- 一个用于管理多个 Claude API 配置的命令行工具。可以轻松在不同环境或账户的 API 密钥和基础 URL 之间切换。☆23Aug 7, 2025Updated 7 months ago
- KNN算法基于Hadoop平台的MapReduce实现☆12Jun 28, 2020Updated 5 years ago
- Resources on the Artifact Evaluation (AE) Process☆17Jan 29, 2021Updated 5 years ago
- NetBricks: A new network function framework based on Rust.☆12Jan 2, 2026Updated 2 months ago
- Development work for qemu☆10Aug 22, 2016Updated 9 years ago
- Accepted to MLSys 2026☆70Mar 2, 2026Updated last week
- ☆13Apr 1, 2017Updated 8 years ago
- A gitbook named studying-containerd-notes☆10Dec 17, 2018Updated 7 years ago