GPUDirect example
☆62Oct 19, 2021Updated 4 years ago
Alternatives and similar repositories for gpudma
Users that are interested in gpudma are comparing it to the libraries listed below
Sorting:
- GPUDirect Async support for IB Verbs☆135Nov 10, 2022Updated 3 years ago
- ☆387Apr 23, 2024Updated last year
- BlueField RSHIM driver☆18Aug 27, 2020Updated 5 years ago
- A user-space test platform for testing the p2pdma Linux kernel framework with NVMe CMBs and other PCIe BAR memory.☆58May 16, 2023Updated 2 years ago
- learning materials of driveos from nvidia drive sdk.☆10Sep 12, 2024Updated last year
- ☆218Nov 23, 2025Updated 3 months ago
- Simple message passing library☆30Aug 28, 2018Updated 7 years ago
- FROZEN: the master branch has merged with the libfabric git repo☆31Oct 3, 2018Updated 7 years ago
- RDMA core userspace libraries and daemons☆15Feb 16, 2026Updated 3 weeks ago
- This tutorial demonstrates how to use CUDA-Aware MPI☆39May 16, 2023Updated 2 years ago
- Emulating DMA Engines on GPUs for Performance and Portability☆41May 17, 2015Updated 10 years ago
- ☆22Jul 19, 2022Updated 3 years ago
- OpenMP-based parallel software for computing the truss decomposition of a graph.☆14Mar 28, 2018Updated 7 years ago
- Linux extra (out of tree) kernel modules for ntrdma.☆21May 23, 2025Updated 9 months ago
- CUDAAdvisor: a GPU profiling tool☆52Aug 24, 2018Updated 7 years ago
- Semi-random funky stuff, mainly for my PhD experiments and articles. Contains calculations and algorithm implementations for various appl…☆23Jan 21, 2026Updated last month
- Scaling Up Subgraph Query Processing with Efficient Subgraph Matching by Shixuan Sun and Dr. Qiong Luo☆18Nov 24, 2018Updated 7 years ago
- Maximal Biclique Enumeration in Bipartite Graphs☆21Mar 3, 2020Updated 6 years ago
- Infiniband Verbs Performance Tests☆919Mar 2, 2026Updated last week
- RDMA and SHARP plugins for nccl library☆224Jan 12, 2026Updated last month
- The test of different distributed-training methods on High-Flyer AIHPC☆27Oct 18, 2022Updated 3 years ago
- Maximum clique computation over large sparse graphs☆23Mar 19, 2022Updated 3 years ago
- Hybrid methods for Parallel Betweenness Centrality on the GPU☆24Dec 20, 2018Updated 7 years ago
- A (hacky) Linux kernel driver for PCI end points that implement p2pmem on the device.☆24Mar 18, 2022Updated 3 years ago
- ☆23Jun 21, 2023Updated 2 years ago
- The implementation of the paper "Parallel Personalized PageRank on Dynamic Graphs"☆25Mar 1, 2018Updated 8 years ago
- RDMA core userspace libraries and daemons☆2,150Updated this week
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆25Jun 6, 2025Updated 9 months ago
- ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.☆27Jul 6, 2023Updated 2 years ago
- Code for monograph "Cohesive Subgraph Computation over Large Sparse Graphs"☆26Apr 24, 2022Updated 3 years ago
- Benchmarks of different devices I have come across☆40Aug 28, 2025Updated 6 months ago
- Fast RDMA-based Ordered Key-Value Store using Remote Learned Cache☆118Jan 21, 2021Updated 5 years ago
- FpgaNIC is an FPGA-based Versatile 100Gb SmartNIC for GPUs [ATC 22]☆141Aug 17, 2023Updated 2 years ago
- MemLiner is a remote-memory-friendly runtime system.☆31Nov 1, 2022Updated 3 years ago
- Kernel Mode RDMA Ping☆28Oct 29, 2025Updated 4 months ago
- ☆25Jan 2, 2021Updated 5 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆122Nov 15, 2023Updated 2 years ago
- Donard: A PCIe Peer-2-Peer kernel patch and library that builds on top of NVM. Express. Also see https://github.com/sbates130272/linux-do…☆32Nov 17, 2016Updated 9 years ago
- This repository contains the ROS wrapper of scorpio's driver plus various ROS applications.☆10Jan 29, 2026Updated last month