Mellanox/gpu_direct_rdma_access

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Mellanox/gpu_direct_rdma_access)

Mellanox / gpu_direct_rdma_access

example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory

☆158

Alternatives and similar repositories for gpu_direct_rdma_access

Users that are interested in gpu_direct_rdma_access are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Mellanox / nv_peer_memory
View on GitHub
☆399Apr 23, 2024Updated 2 years ago
NVIDIA / gdrcopy
View on GitHub
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
☆1,401Jul 14, 2026Updated last week
linux-rdma / perftest
View on GitHub
Infiniband Verbs Performance Tests
☆999Jul 12, 2026Updated 2 weeks ago
jcxue / RDMA-Tutorial
View on GitHub
A tutorial on RDMA based programming using code examples
☆637Jan 3, 2020Updated 6 years ago
DolphinICS / cuda-rdma-bench
View on GitHub
NVIDIA GPU direct RDMA using SISCI API
☆18Apr 8, 2018Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
karakozov / gpudma
View on GitHub
GPUDirect example
☆64Oct 19, 2021Updated 4 years ago
Mellanox / nccl-rdma-sharp-plugins
View on GitHub
RDMA and SHARP plugins for nccl library
☆233Apr 3, 2026Updated 3 months ago
ZaidQureshi / bam
View on GitHub
☆235Mar 28, 2026Updated 3 months ago
redn-io / RedN
View on GitHub
Arbitrary offloads for RDMA NICs
☆100Apr 25, 2022Updated 4 years ago
gpudirect / gdasync
View on GitHub
GPUDirect Async suite
☆16Dec 5, 2018Updated 7 years ago
NVIDIA / gds-nvidia-fs
View on GitHub
NVIDIA GPUDirect Storage Driver
☆367Jun 1, 2026Updated last month
linux-rdma / rdma-core
View on GitHub
RDMA core userspace libraries and daemons
☆2,318Updated this week
rs3lab / SynCord
View on GitHub
https://rs3lab.github.io/SynCord/
☆26Nov 23, 2022Updated 3 years ago
microsoft / NPKit
View on GitHub
NCCL Profiling Kit
☆155Jul 1, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated 11 months ago
openucx / ucx
View on GitHub
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
☆1,679Updated this week
howardlau1999 / ucxpp
View on GitHub
☆21Jul 13, 2026Updated last week
claudebarthels / infinity
View on GitHub
A lightweight C++ RDMA library for InfiniBand networks.
☆211May 12, 2022Updated 4 years ago
gpudirect / libgdsync
View on GitHub
GPUDirect Async support for IB Verbs
☆139Nov 10, 2022Updated 3 years ago
dmemsys / FUSEE
View on GitHub
This is the implementation repository of our FAST'23 paper: FUSEE: A Fully Memory-Disaggregated Key-Value Store.
☆62Feb 14, 2023Updated 3 years ago
rs3lab / TCLocks
View on GitHub
Repo for OSDI 2023 paper: "Ship your Critical Section Not Your Data: Enabling Transparent Delegation with TCLocks"
☆21Nov 6, 2024Updated last year
zzh-wisdom / RDMA-Programming
View on GitHub
rdma编程学习
☆25Dec 6, 2021Updated 4 years ago
enfiskutensykkel / ssd-gpu-dma
View on GitHub
Build userspace NVMe drivers and storage applications with CUDA support
☆442Dec 18, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
NVIDIA / jetson-rdma-picoevb
View on GitHub
Minimal HW-based demo of GPUDirect RDMA on NVIDIA Jetson AGX Xavier running L4T
☆223Jul 15, 2024Updated 2 years ago
SJTU-IPADS / PhoenixOS
View on GitHub
Fast OS-level support for GPU checkpoint and restore
☆285Sep 28, 2025Updated 9 months ago
openucx / ucc
View on GitHub
Unified Collective Communication Library
☆311Jul 17, 2026Updated last week
dmemsys / awesome-disaggregated-memory
View on GitHub
A collection of awesome researchers and papers about disaggregated memory.
☆192Jul 15, 2026Updated last week
YJMSTR / flash-linear-attention
View on GitHub
FLA but cuTile
☆27Apr 17, 2026Updated 3 months ago
casys-kaist / LineFS
View on GitHub
LineFS: Efficient SmartNIC Offload of a Distributed File System with Pipeline Parallelism
☆90Dec 24, 2021Updated 4 years ago
Mellanox / rdma_fc
View on GitHub
Demonstration of flow control over RDMA fabric
☆13Jun 28, 2018Updated 8 years ago
HanGuo97 / hilt
View on GitHub
☆40Dec 14, 2025Updated 7 months ago
uclasystem / canvas
View on GitHub
Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory
☆38Apr 19, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PASAUCMerced / Sentinel
View on GitHub
Efficient-Tensor-Management-on-HM-for-Deep-Learning
☆11Nov 15, 2021Updated 4 years ago
coldfunction / qCUDA
View on GitHub
qCUDA: GPGPU Virtualization at a New API Remoting Method with Para-virtualization
☆136Feb 9, 2022Updated 4 years ago
NVIDIA / MagnumIO
View on GitHub
Magnum IO community repo
☆117Jun 22, 2026Updated last month
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
google / nccl-fastsocket
View on GitHub
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
☆125Nov 15, 2023Updated 2 years ago
uclasystem / hermit
View on GitHub
Hermit: Low-Latency, High-Throughput, and Transparent Remote Memory via Feedback-Directed Asynchrony
☆35May 29, 2024Updated 2 years ago
SJTU-IPADS / PhoenixOS-Remoting
View on GitHub
☆21Jul 10, 2025Updated last year