NVIDIA/gdrcopy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA/gdrcopy)

NVIDIA / gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

☆1,399

Alternatives and similar repositories for gdrcopy

Users that are interested in gdrcopy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Mellanox / nv_peer_memory
View on GitHub
☆399Apr 23, 2024Updated 2 years ago
openucx / ucx
View on GitHub
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
☆1,673Updated this week
NVIDIA / nccl
View on GitHub
Optimized primitives for collective multi-GPU communication
☆4,892Updated this week
Mellanox / nccl-rdma-sharp-plugins
View on GitHub
RDMA and SHARP plugins for nccl library
☆233Apr 3, 2026Updated 3 months ago
NVIDIA / multi-gpu-programming-models
View on GitHub
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
☆908Sep 26, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
linux-rdma / perftest
View on GitHub
Infiniband Verbs Performance Tests
☆998Jul 12, 2026Updated last week
gpudirect / libgdsync
View on GitHub
GPUDirect Async support for IB Verbs
☆139Nov 10, 2022Updated 3 years ago
Mellanox / gpu_direct_rdma_access
View on GitHub
example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory
☆157Jul 30, 2024Updated last year
NVIDIA / gds-nvidia-fs
View on GitHub
NVIDIA GPUDirect Storage Driver
☆367Jun 1, 2026Updated last month
NVIDIA / nvbandwidth
View on GitHub
A tool for bandwidth measurements on NVIDIA GPUs.
☆732Apr 8, 2026Updated 3 months ago
NVIDIA / nccl-tests
View on GitHub
NCCL Tests
☆1,595Jul 9, 2026Updated last week
linux-rdma / rdma-core
View on GitHub
RDMA core userspace libraries and daemons
☆2,311Jul 8, 2026Updated last week
NVIDIA / nvbench
View on GitHub
CUDA Kernel Benchmarking Library
☆900Updated this week
openucx / ucc
View on GitHub
Unified Collective Communication Library
☆310Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
karakozov / gpudma
View on GitHub
GPUDirect example
☆64Oct 19, 2021Updated 4 years ago
ai-dynamo / nixl
View on GitHub
NVIDIA Inference Xfer Library (NIXL)
☆1,138Updated this week
enfiskutensykkel / ssd-gpu-dma
View on GitHub
Build userspace NVMe drivers and storage applications with CUDA support
☆441Dec 18, 2023Updated 2 years ago
NVIDIA / jitify
View on GitHub
A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
☆573Sep 15, 2025Updated 10 months ago
NVIDIA / nvshmem
View on GitHub
NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process com…
☆560Updated this week
microsoft / NPKit
View on GitHub
NCCL Profiling Kit
☆155Jul 1, 2024Updated 2 years ago
jcxue / RDMA-Tutorial
View on GitHub
A tutorial on RDMA based programming using code examples
☆635Jan 3, 2020Updated 6 years ago
NVIDIA / cutlass
View on GitHub
CUDA Templates and Python DSLs for High-Performance Linear Algebra
☆10,104Updated this week
microsoft / mscclpp
View on GitHub
MSCCL++: A GPU-driven communication stack for scalable AI applications
☆541Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bytedance / flux
View on GitHub
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
☆1,343Aug 28, 2025Updated 10 months ago
pytorch / gloo
View on GitHub
Collective communications library with various primitives for multi-machine training.
☆1,437Jul 1, 2026Updated 2 weeks ago
microsoft / msccl
View on GitHub
Microsoft Collective Communication Library
☆394Sep 20, 2023Updated 2 years ago
NVIDIA / cub
View on GitHub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
☆1,840Oct 9, 2023Updated 2 years ago
NVIDIA / nvcomp
View on GitHub
Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…
☆627Jul 13, 2026Updated last week
efficient / rdma_bench
View on GitHub
A framework to understand RDMA
☆412Oct 12, 2023Updated 2 years ago
uccl-project / uccl
View on GitHub
UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g…
☆1,465Updated this week
aws / aws-ofi-nccl
View on GitHub
This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.
☆228Updated this week
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,493Jul 11, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
open-mpi / ompi
View on GitHub
Open MPI main development repository
☆2,618Updated this week
antgroup / glake
View on GitHub
GLake: optimizing GPU memory management and IO transmission.
☆501Mar 24, 2025Updated last year
NVIDIA / NVTX
View on GitHub
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…
☆544Updated this week
NVIDIA / cuCollections
View on GitHub
☆654Updated this week
ofiwg / libfabric
View on GitHub
Open Fabric Interfaces
☆816Updated this week
NVIDIA / cnmem
View on GitHub
A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory
☆298Nov 28, 2018Updated 7 years ago
NVIDIA / cccl
View on GitHub
CUDA Core Compute Libraries
☆2,431Updated this week