Dolphinics / cuda-rdma-benchLinks
NVIDIA GPU direct RDMA using SISCI API
☆17Updated 7 years ago
Alternatives and similar repositories for cuda-rdma-bench
Users that are interested in cuda-rdma-bench are comparing it to the libraries listed below
Sorting:
- GPUDirect example☆60Updated 4 years ago
- The classic STREAM benchmark, extended to measure NUMA effects.☆38Updated 6 years ago
- A kernel module to enable RDMA transfers to/from IO, PFN and DAX mapped memory☆10Updated 10 years ago
- Persistent Collectives X- A collective communication library for high performance, low cost persistent collectives over RDMA devices.☆14Updated 6 years ago
- GPUDirect Async support for IB Verbs☆133Updated 3 years ago
- A framework for pipelined computing on GPU☆30Updated 6 years ago
- Magnum IO community repo☆105Updated 2 weeks ago
- Dynamic and Transparent Memory Sharing for Accelerating Big Data Analytics Workloads in Virtualized Cloud☆16Updated 8 years ago
- Graph500 reference implementations☆181Updated 3 years ago
- ☆75Updated 9 years ago
- [USENIX ATC 2021] Exploring the Design Space of Page Management for Multi-Tiered Memory Systems☆48Updated 3 years ago
- verbs profiling library☆22Updated 2 years ago
- Tensorflow is a computational library using data flow graphs for scalable machine learning, and Tensorflow-RDMA is the implementation ov…☆59Updated 3 years ago
- Heterogeneous Memory Software Development Kit☆91Updated last month
- ☆31Updated 4 years ago
- Arbitrary offloads for RDMA NICs☆99Updated 3 years ago
- ☆20Updated 8 years ago
- ☆68Updated 8 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 12 years ago
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆91Updated last year
- Kernel repo of "Nimble Page Management for Tiered Memory Systems" in ASPLOS 2019☆46Updated 3 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated last year
- µSuite: A Benchmark Suite for Microservices☆44Updated 5 years ago
- A platform to evaluate techniques used in multicore graph processing.☆37Updated 7 years ago
- A user level library for applications to transparently use Intel DSA.☆39Updated last month
- 2RDMA_Aware_Programming_user_manual 一书的中文翻译☆79Updated 12 years ago
- Asynchronous Multi-GPU Programming Framework☆48Updated 4 years ago
- Mellanox libibverbs☆76Updated 6 years ago
- TLB Benchmarks☆35Updated 8 years ago
- ☆27Updated 6 months ago