CGCL-codes / Tensorflow-RDMA
Tensorflow is a computational library using data flow graphs for scalable machine learning, and Tensorflow-RDMA is the implementation over RDMA, which can get about 4.5x speedup on two nodes comparing with TCP/IP.
☆58Updated 2 years ago
Alternatives and similar repositories for Tensorflow-RDMA:
Users that are interested in Tensorflow-RDMA are comparing it to the libraries listed below
- Source code for our OSDI 2016 paper☆110Updated 6 years ago
- ☆67Updated 7 years ago
- LITE Kernel RDMA Support for Datacenter Applications. SOSP 2017.☆107Updated 4 years ago
- Fast In-memory Transaction Processing using Hybrid RDMA Primitives☆66Updated 6 years ago
- verbs profiling library☆22Updated last year
- ☆72Updated 8 years ago
- Cocytus is an efficient and available in-memory K/V-store through hybrid erasure coding and replication☆30Updated 9 years ago
- Fine-grained GPU sharing primitives☆141Updated 5 years ago
- ☆33Updated 6 years ago
- Writing RDMA applications on Linux Example programs☆45Updated last year
- A Simple RDMA Wheel☆21Updated 6 years ago
- A NUMA-aware Graph-structured Analytics Framework☆42Updated 6 years ago
- RDMA Optimization on MXNet☆14Updated 7 years ago
- 2RDMA_Aware_Programming_user_manual 一书的中文翻译☆78Updated 11 years ago
- Arbitrary offloads for RDMA NICs☆89Updated 2 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆98Updated 3 years ago
- GPU-specialized parameter server for GPU machine learning.☆101Updated 6 years ago
- A lightweight parameter server interface☆76Updated 2 years ago
- ☆82Updated 2 years ago
- HME a hybrid memory emulator for studying the performance and energy characteristics of upcoming NVM technologies. HME exploits features …☆49Updated 2 years ago
- An RDMA-enabled Distributed Persistent Memory File System☆156Updated 7 years ago
- [FAST 2022] FORD: Fast One-sided RDMA-based Distributed Transactions for Disaggregated Persistent Memory☆60Updated 9 months ago
- ☆20Updated 7 years ago
- Fast RDMA-based Ordered Key-Value Store using Remote Learned Cache☆115Updated 4 years ago
- Fast In-memory Transaction Processing using RDMA and HTM☆57Updated 9 years ago
- ☆21Updated 2 years ago
- RLib is a header-only library for easier usage of RDMA.☆45Updated 3 years ago
- ☆7Updated 7 years ago
- Tiresias is a GPU cluster manager for distributed deep learning training.☆152Updated 4 years ago
- Prefetching and efficient data path for memory disaggregation☆67Updated 4 years ago