google / nccl-plugin-gpudirecttcpxLinks
☆15Updated 3 weeks ago
Alternatives and similar repositories for nccl-plugin-gpudirecttcpx
Users that are interested in nccl-plugin-gpudirecttcpx are comparing it to the libraries listed below
Sorting:
- ☆46Updated 2 months ago
- example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory☆152Updated last year
- A collection of tools, code, and documentation to understand the host network on real server hardware.☆44Updated last year
- Arbitrary offloads for RDMA NICs☆99Updated 3 years ago
- Benchmark Test Suite for RDMA Networks☆59Updated 2 years ago
- ☆44Updated last year
- ☆70Updated 3 years ago
- ☆11Updated 3 weeks ago
- Justitia provides RDMA isolation between applications with diverse requirements.☆43Updated 3 years ago
- Demystifying Datapath Accelerator Enhanced Off-path SmartNIC [ICNP24]☆56Updated last year
- ☆22Updated last year
- NCCL Profiling Kit☆152Updated last year
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆80Updated 2 years ago
- ☆93Updated 5 months ago
- Ensō is a high-performance streaming interface for NIC-application communication.☆76Updated 5 months ago
- Flexible, high-performance TCP offload to SmartNICs using fine-grained parallelism☆60Updated 3 years ago
- Benchmark Suite for RDMA Performance Isolation☆41Updated 2 years ago
- LineFS: Efficient SmartNIC Offload of a Distributed File System with Pipeline Parallelism☆89Updated 4 years ago
- Efficient GPU communication over multiple NICs.☆22Updated 2 months ago
- hostCC is a congestion control architecture which handles host congestion, along with in-network congestion☆58Updated last year
- ☆50Updated last year
- ☆231Updated last month
- Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks☆100Updated 4 years ago
- ☆77Updated last month
- tests RTT latency with HW timestamping using RDMA Write on DCT QP type☆21Updated 5 years ago
- GPUDirect Async support for IB Verbs☆135Updated 3 years ago
- A command line utility to manage the configuration of a system's high performance network interfaces for RoCE deployments☆35Updated 2 years ago
- [ACM CoNEXT22 Best Paper Award] NTSocks: An ultra-low latency and compatible PCIe interconnect for rack-scale disaggregation.☆41Updated last year
- rFaaS: a high-performance FaaS platform with RDMA acceleration for low-latency invocations.☆58Updated 7 months ago
- Repository for MLCommons Chakra schema and tools☆39Updated 2 years ago