karakozov / gpudmaView external linksLinks
GPUDirect example
☆62Oct 19, 2021Updated 4 years ago
Alternatives and similar repositories for gpudma
Users that are interested in gpudma are comparing it to the libraries listed below
Sorting:
- NVIDIA GPU direct RDMA using SISCI API☆17Apr 8, 2018Updated 7 years ago
- Minimal HW-based demo of GPUDirect RDMA on NVIDIA Jetson AGX Xavier running L4T☆206Jul 15, 2024Updated last year
- GPUDirect Async support for IB Verbs☆135Nov 10, 2022Updated 3 years ago
- ☆384Apr 23, 2024Updated last year
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,339Dec 17, 2025Updated last month
- A user-space test platform for testing the p2pdma Linux kernel framework with NVMe CMBs and other PCIe BAR memory.☆58May 16, 2023Updated 2 years ago
- learning materials of driveos from nvidia drive sdk.☆11Sep 12, 2024Updated last year
- ☆45Jul 10, 2017Updated 8 years ago
- Magnum IO community repo☆109Dec 5, 2025Updated 2 months ago
- ☆217Nov 23, 2025Updated 2 months ago
- Simple message passing library☆30Aug 28, 2018Updated 7 years ago
- GPUDirect Async suite☆17Dec 5, 2018Updated 7 years ago
- Pseudo InfiniBand HCA driver (for Linux)☆36May 10, 2016Updated 9 years ago
- FROZEN: the master branch has merged with the libfabric git repo☆31Oct 3, 2018Updated 7 years ago
- RDMA core userspace libraries and daemons☆15Updated this week
- This tutorial demonstrates how to use CUDA-Aware MPI☆39May 16, 2023Updated 2 years ago
- Source code of "Accelerating Truss Decomposition on Heterogeneous Processors", accepted by VLDB'20 - By Yulin Che, Zhuohang Lai, Shixuan …☆16May 25, 2020Updated 5 years ago
- Emulating DMA Engines on GPUs for Performance and Portability☆41May 17, 2015Updated 10 years ago
- ☆19May 10, 2025Updated 9 months ago
- Mellanox libibverbs☆77Aug 28, 2019Updated 6 years ago
- Build userspace NVMe drivers and storage applications with CUDA support☆416Dec 18, 2023Updated 2 years ago
- OpenMP-based parallel software for computing the truss decomposition of a graph.☆14Mar 28, 2018Updated 7 years ago
- Pytorch process group third-party plugin for UCC☆21Apr 15, 2024Updated last year
- Linux extra (out of tree) kernel modules for ntrdma.☆21May 23, 2025Updated 8 months ago
- CUDAAdvisor: a GPU profiling tool☆52Aug 24, 2018Updated 7 years ago
- GPU Affinity is a package to automatically set the CPU process affinity to match the hardware architecture on a given platform☆29Dec 8, 2023Updated 2 years ago
- Semi-random funky stuff, mainly for my PhD experiments and articles. Contains calculations and algorithm implementations for various appl…☆23Jan 21, 2026Updated 3 weeks ago
- Maximal Biclique Enumeration in Bipartite Graphs☆21Mar 3, 2020Updated 5 years ago
- Scaling Up Subgraph Query Processing with Efficient Subgraph Matching by Shixuan Sun and Dr. Qiong Luo☆18Nov 24, 2018Updated 7 years ago
- Infiniband Verbs Performance Tests☆910Jan 11, 2026Updated last month
- Maximum clique computation over large sparse graphs☆23Mar 19, 2022Updated 3 years ago
- The test of different distributed-training methods on High-Flyer AIHPC☆27Oct 18, 2022Updated 3 years ago
- Hybrid methods for Parallel Betweenness Centrality on the GPU☆24Dec 20, 2018Updated 7 years ago
- A (hacky) Linux kernel driver for PCI end points that implement p2pmem on the device.☆24Mar 18, 2022Updated 3 years ago
- ☆24Jun 21, 2023Updated 2 years ago
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆25Jun 6, 2025Updated 8 months ago
- Linux based user-space RSHIM driver for the Mellanox BlueField SoC☆35Updated this week
- Codes of the paper "Speeding Up Set Intersections in Graph Algorithms using SIMD Instructions" that was published in SIGMOD 2018. Authors…☆31Jan 23, 2019Updated 7 years ago
- Code for monograph "Cohesive Subgraph Computation over Large Sparse Graphs"☆26Apr 24, 2022Updated 3 years ago