Mellanox / gpu_direct_rdma_access
example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory
☆127Updated 8 months ago
Alternatives and similar repositories for gpu_direct_rdma_access:
Users that are interested in gpu_direct_rdma_access are comparing it to the libraries listed below
- NCCL Profiling Kit☆130Updated 9 months ago
- Magnum IO community repo☆89Updated 3 months ago
- ☆176Updated 2 years ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆72Updated last year
- RDMA and SHARP plugins for nccl library☆189Updated 2 weeks ago
- ☆155Updated last month
- Repository for MLCommons Chakra schema and tools☆95Updated last month
- Arbitrary offloads for RDMA NICs☆89Updated 3 years ago
- GPUDirect Async support for IB Verbs☆110Updated 2 years ago
- LineFS: Efficient SmartNIC Offload of a Distributed File System with Pipeline Parallelism☆86Updated 3 years ago
- ☆14Updated 2 months ago
- ☆64Updated 3 years ago
- RDMA exmaple☆198Updated 2 years ago
- Synthesizer for optimal collective communication algorithms☆105Updated last year
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆94Updated 2 years ago
- ☆24Updated 2 years ago
- Hooked CUDA-related dynamic libraries by using automated code generation tools.☆152Updated last year
- ☆23Updated 2 years ago
- An interference-aware scheduler for fine-grained GPU sharing☆132Updated 2 months ago
- rdma编程学习☆24Updated 3 years ago
- ☆340Updated last year
- Mellanox libibverbs☆64Updated 5 years ago
- GPUDirect example☆59Updated 3 years ago
- Benchmark Test Suite for RDMA Networks☆53Updated 2 years ago
- Paella: Low-latency Model Serving with Virtualized GPU Scheduling☆58Updated 11 months ago
- ☆11Updated last year
- Thunder Research Group's Collective Communication Library☆36Updated last year
- Microsoft Collective Communication Library☆343Updated last year
- ☆50Updated 6 months ago
- verbs profiling library☆22Updated last year