CGCL-codes / Tensorflow-RDMA
Tensorflow is a computational library using data flow graphs for scalable machine learning, and Tensorflow-RDMA is the implementation over RDMA, which can get about 4.5x speedup on two nodes comparing with TCP/IP.
☆58Updated 2 years ago
Alternatives and similar repositories for Tensorflow-RDMA:
Users that are interested in Tensorflow-RDMA are comparing it to the libraries listed below
- ☆67Updated 7 years ago
- verbs profiling library☆22Updated last year
- Fine-grained GPU sharing primitives☆141Updated 5 years ago
- 2RDMA_Aware_Programming_user_manual 一书的中文翻译☆78Updated 11 years ago
- Fast In-memory Transaction Processing using Hybrid RDMA Primitives☆66Updated 6 years ago
- LITE Kernel RDMA Support for Datacenter Applications. SOSP 2017.☆107Updated 4 years ago
- GPU-specialized parameter server for GPU machine learning.☆100Updated 6 years ago
- Source code for our OSDI 2016 paper☆110Updated 6 years ago
- ☆22Updated 5 years ago
- A NUMA-aware Graph-structured Analytics Framework☆42Updated 6 years ago
- Writing RDMA applications on Linux Example programs☆45Updated last year
- RDMA Optimization on MXNet☆14Updated 7 years ago
- this is the release repository of superneurons☆52Updated 4 years ago
- ☆21Updated 2 years ago
- ☆72Updated 8 years ago
- Arbitrary offloads for RDMA NICs☆88Updated 2 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆98Updated 3 years ago
- A framework for pipelined computing on GPU☆29Updated 5 years ago
- A Simple RDMA Wheel☆21Updated 5 years ago
- BytePS examples (Vision, NLP, GAN, etc)☆19Updated 2 years ago
- A kernel module to enable RDMA transfers to/from IO, PFN and DAX mapped memory☆10Updated 9 years ago
- Infiniband verbs performance tests (fork of git://git.openfabrics.org/~grockah/perftest.git)☆17Updated 9 years ago
- ☆20Updated 7 years ago
- ☆33Updated 6 years ago
- An RDMA-enabled Distributed Persistent Memory File System☆156Updated 7 years ago
- A lightweight parameter server interface☆75Updated 2 years ago
- Documentation for StreamExecutor open source proposal☆83Updated 8 years ago
- High Performance Network Library for RDMA☆27Updated 2 years ago
- Prefetching and efficient data path for memory disaggregation☆67Updated 4 years ago
- Analyze network performance in distributed training☆18Updated 4 years ago