DolphinICS / cuda-rdma-benchLinks
NVIDIA GPU direct RDMA using SISCI API
☆17Updated 7 years ago
Alternatives and similar repositories for cuda-rdma-bench
Users that are interested in cuda-rdma-bench are comparing it to the libraries listed below
Sorting:
- GPUDirect example☆60Updated 4 years ago
- A user level library for applications to transparently use Intel DSA.☆40Updated last month
- A kernel module to enable RDMA transfers to/from IO, PFN and DAX mapped memory☆10Updated 10 years ago
- The classic STREAM benchmark, extended to measure NUMA effects.☆38Updated 6 years ago
- A new memory mapping interface for efficient direct user-space access to byte-addressable storage, published in MICRO2022.☆15Updated 3 years ago
- Dynamic and Transparent Memory Sharing for Accelerating Big Data Analytics Workloads in Virtualized Cloud☆16Updated 8 years ago
- Heterogeneous Memory Software Development Kit☆90Updated 2 months ago
- ☆20Updated 8 years ago
- Magnum IO community repo☆106Updated last month
- Persistent Collectives X- A collective communication library for high performance, low cost persistent collectives over RDMA devices.☆14Updated 6 years ago
- verbs profiling library☆22Updated 2 years ago
- ☆68Updated 8 years ago
- Tensorflow is a computational library using data flow graphs for scalable machine learning, and Tensorflow-RDMA is the implementation ov…☆59Updated 3 years ago
- A framework for pipelined computing on GPU☆30Updated 6 years ago
- Blaze runtime system that support efficient accelerator integration for big data.☆24Updated 8 years ago
- A user-space test platform for testing the p2pdma Linux kernel framework with NVMe CMBs and other PCIe BAR memory.☆57Updated 2 years ago
- ☆46Updated 8 years ago
- ☆76Updated 9 years ago
- A Memory-Disaggregated Managed Runtime.☆67Updated 4 years ago
- GPUDirect Async support for IB Verbs☆134Updated 3 years ago
- Arbitrary offloads for RDMA NICs☆99Updated 3 years ago
- Mellanox libibverbs☆77Updated 6 years ago
- ☆31Updated 4 years ago
- An FPGA-based full-stack in-storage computing system.☆38Updated 5 years ago
- Infiniband verbs performance tests (fork of git://git.openfabrics.org/~grockah/perftest.git)☆20Updated 9 years ago
- Graph500 reference implementations☆181Updated 3 years ago
- ☆21Updated 4 years ago
- [USENIX ATC 2021] Exploring the Design Space of Page Management for Multi-Tiered Memory Systems☆48Updated 3 years ago
- A disaggregated memory orchestration system that virtualizes cluster wide memory to scale data intensive, large memory workloads in virtu…☆13Updated 6 years ago
- Scaling Up Memory Disaggregated Applications with SMART☆32Updated last year