Tensorflow is a computational library using data flow graphs for scalable machine learning, and Tensorflow-RDMA is the implementation over RDMA, which can get about 4.5x speedup on two nodes comparing with TCP/IP.
☆59Nov 27, 2022Updated 3 years ago
Alternatives and similar repositories for Tensorflow-RDMA
Users that are interested in Tensorflow-RDMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LITE Kernel RDMA Support for Datacenter Applications. SOSP 2017.☆112Jul 9, 2020Updated 5 years ago
- Distributed Shared Persistent Memory. SoCC 2017☆70Jul 20, 2020Updated 5 years ago
- An RDMA-enabled Distributed Persistent Memory File System☆162Oct 14, 2017Updated 8 years ago
- Frog is Asynchronous Graph Processing on GPU with Hybrid Coloring Model. The fundamental idea is based on Pareto principle (or 80-20 rule…☆36May 29, 2021Updated 5 years ago
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆75Mar 2, 2018Updated 8 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Falcon: Fast OLTP Engine for Persistent Cache and Non-Volatile Memory☆11Nov 1, 2023Updated 2 years ago
- Source code for our OSDI 2016 paper☆110Nov 11, 2018Updated 7 years ago
- [FAST 2022] FORD: Fast One-sided RDMA-based Distributed Transactions for Disaggregated Persistent Memory☆62Jun 22, 2024Updated 2 years ago
- A tutorial on RDMA based programming using code examples☆632Jan 3, 2020Updated 6 years ago
- ☆21Nov 29, 2022Updated 3 years ago
- Enterprise: Breadth-First Graph Traversal on GPUs. SC'15.☆33May 20, 2017Updated 9 years ago
- This is the Group-Meeting collections of HKUST System NetworkING (SING) Research Group.☆27Oct 3, 2019Updated 6 years ago
- This is the implementation repository of our SOSP'24 paper: Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value …☆24Oct 20, 2024Updated last year
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Aug 29, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A P4 implementation of a 5G UPF for BMv2☆15Oct 11, 2021Updated 4 years ago
- A framework to understand RDMA☆413Oct 12, 2023Updated 2 years ago
- GPU-accelerated LLM Training Simulator☆22Jun 26, 2025Updated last year
- Mirror of Apache crail (Incubating)☆152Jul 3, 2022Updated 3 years ago
- ☆33Mar 31, 2021Updated 5 years ago
- Rcmp: Reconstructing RDMA-based Memory Disaggregation via CXL☆64Dec 26, 2023Updated 2 years ago
- An RDMA-powered, fast, and scalable Paxos protocol☆26Jun 15, 2019Updated 7 years ago
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …☆16Sep 18, 2025Updated 9 months ago
- ☆21Jan 2, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Quartz: A DRAM-based performance emulator for NVM☆162Jul 28, 2019Updated 6 years ago
- ☆70May 1, 2017Updated 9 years ago
- A scalable RAID system to aggregate performance and capacity of the next-generation storage.☆15Jan 3, 2024Updated 2 years ago
- Gengar, a distributed shared hybrid memory pool with RDMA support. Gengar allows applications to access remote DRAM/NVM in a large and gl…☆24May 24, 2022Updated 4 years ago
- SmartNIC☆14Dec 13, 2018Updated 7 years ago
- A Progam-Behavior-Guided Far Memory System☆36Oct 26, 2023Updated 2 years ago
- Sample code from thegeekinthecorner.com☆280Sep 13, 2020Updated 5 years ago
- Simulator of a memory controller to connect DRAMSim and FlashDIMMSim into one unified memory☆17Apr 4, 2024Updated 2 years ago
- OFI Programmer's Guide☆52Dec 29, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression☆19Jul 30, 2024Updated last year
- ddl-benchmarks: Benchmarks for Distributed Deep Learning☆36May 29, 2020Updated 6 years ago
- CoRM: Compactable Remote Memory over RDMA☆20Jun 18, 2021Updated 5 years ago
- ☆25Aug 1, 2016Updated 9 years ago
- MQ-ECN NS2 Simulation☆11Feb 26, 2016Updated 10 years ago
- Mallacc: Accelerating Memory Allocation☆13Jan 2, 2018Updated 8 years ago
- Primo: Practical Learning-Augmented Systems with Interpretable Models☆19Dec 26, 2023Updated 2 years ago