google / nccl-plugin-gpudirecttcpxLinks
☆13Updated last week
Alternatives and similar repositories for nccl-plugin-gpudirecttcpx
Users that are interested in nccl-plugin-gpudirecttcpx are comparing it to the libraries listed below
Sorting:
- Benchmark Test Suite for RDMA Networks☆59Updated 2 years ago
- ☆70Updated 3 years ago
- Benchmark Suite for RDMA Performance Isolation☆41Updated 2 years ago
- ☆46Updated 2 months ago
- Arbitrary offloads for RDMA NICs☆99Updated 3 years ago
- ☆93Updated 5 months ago
- A collection of tools, code, and documentation to understand the host network on real server hardware.☆44Updated last year
- Justitia provides RDMA isolation between applications with diverse requirements.☆43Updated 3 years ago
- Flexible, high-performance TCP offload to SmartNICs using fine-grained parallelism☆60Updated 3 years ago
- Repository for MLCommons Chakra schema and tools☆153Updated 3 months ago
- ☆50Updated last year
- ☆74Updated last month
- example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory☆152Updated last year
- ☆44Updated last year
- Reexamining Direct Cache Access to Optimize I/O Intensive Applications for Multi-hundred-gigabit Networks☆100Updated 4 years ago
- Demystifying Datapath Accelerator Enhanced Off-path SmartNIC [ICNP24]☆55Updated last year
- A command line utility to manage the configuration of a system's high performance network interfaces for RoCE deployments☆35Updated 2 years ago
- ☆230Updated last month
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆80Updated 2 years ago
- hostCC is a congestion control architecture which handles host congestion, along with in-network congestion☆58Updated last year
- NCCL Profiling Kit☆150Updated last year
- GPUDirect Async support for IB Verbs☆135Updated 3 years ago
- tests RTT latency with HW timestamping using RDMA Write on DCT QP type☆21Updated 4 years ago
- ☆22Updated 11 months ago
- Ensō is a high-performance streaming interface for NIC-application communication.☆76Updated 4 months ago
- An Automated Performance Optimization Framework for P4-Programmable SmartNICs☆27Updated 2 years ago
- ☆38Updated 3 years ago
- [ACM CoNEXT22 Best Paper Award] NTSocks: An ultra-low latency and compatible PCIe interconnect for rack-scale disaggregation.☆41Updated last year
- ☆53Updated last month
- RDMA exmaple☆233Updated 3 years ago