Dolphinics / cuda-rdma-benchLinks
NVIDIA GPU direct RDMA using SISCI API
☆17Updated 7 years ago
Alternatives and similar repositories for cuda-rdma-bench
Users that are interested in cuda-rdma-bench are comparing it to the libraries listed below
Sorting:
- GPUDirect example☆60Updated 3 years ago
- Blaze runtime system that support efficient accelerator integration for big data.☆24Updated 8 years ago
- ☆68Updated 8 years ago
- A kernel module to enable RDMA transfers to/from IO, PFN and DAX mapped memory☆10Updated 10 years ago
- GPUDirect Async support for IB Verbs☆130Updated 2 years ago
- verbs profiling library☆23Updated last year
- Infiniband verbs performance tests (fork of git://git.openfabrics.org/~grockah/perftest.git)☆20Updated 9 years ago
- Magnum IO community repo☆98Updated 3 weeks ago
- Heterogeneous Memory Software Development Kit☆83Updated 8 months ago
- A user-space test platform for testing the p2pdma Linux kernel framework with NVMe CMBs and other PCIe BAR memory.☆56Updated 2 years ago
- A framework for pipelined computing on GPU☆29Updated 6 years ago
- Persistent Collectives X- A collective communication library for high performance, low cost persistent collectives over RDMA devices.☆14Updated 6 years ago
- LITE Kernel RDMA Support for Datacenter Applications. SOSP 2017.☆111Updated 5 years ago
- Graph500 reference implementations☆179Updated 3 years ago
- A user level library for applications to transparently use Intel DSA.☆38Updated last week
- ☆20Updated 8 years ago
- A new memory mapping interface for efficient direct user-space access to byte-addressable storage, published in MICRO2022.☆15Updated 2 years ago
- ☆73Updated 9 years ago
- OpenCSD: eBPF Computational Storage Device (CSD) for Zoned Namespace (ZNS) SSDs in QEMU☆62Updated last year
- Dynamic and Transparent Memory Sharing for Accelerating Big Data Analytics Workloads in Virtualized Cloud☆16Updated 8 years ago
- GPUnet is a native GPU networking layer that provides a socket abstraction over Infiniband to GPU programs for NVIDIA GPUs.☆117Updated 10 years ago
- Prefetching and efficient data path for memory disaggregation☆68Updated 5 years ago
- Tensorflow is a computational library using data flow graphs for scalable machine learning, and Tensorflow-RDMA is the implementation ov…☆58Updated 2 years ago
- Source code for our OSDI 2016 paper☆110Updated 6 years ago
- ☆45Updated 8 years ago
- Cluster Far Mem, framework to execute single job and multi job experiments using fastswap☆21Updated last year
- ☆24Updated 3 months ago
- LineFS: Efficient SmartNIC Offload of a Distributed File System with Pipeline Parallelism☆86Updated 3 years ago
- ☆31Updated 4 years ago
- A Memory-Disaggregated Managed Runtime.☆66Updated 4 years ago