Tensorflow is a computational library using data flow graphs for scalable machine learning, and Tensorflow-RDMA is the implementation over RDMA, which can get about 4.5x speedup on two nodes comparing with TCP/IP.
☆59Nov 27, 2022Updated 3 years ago
Alternatives and similar repositories for Tensorflow-RDMA
Users that are interested in Tensorflow-RDMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LITE Kernel RDMA Support for Datacenter Applications. SOSP 2017.☆111Jul 9, 2020Updated 5 years ago
- An RDMA-enabled Distributed Persistent Memory File System☆162Oct 14, 2017Updated 8 years ago
- Frog is Asynchronous Graph Processing on GPU with Hybrid Coloring Model. The fundamental idea is based on Pareto principle (or 80-20 rule…☆36May 29, 2021Updated 4 years ago
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆74Mar 2, 2018Updated 8 years ago
- Falcon: Fast OLTP Engine for Persistent Cache and Non-Volatile Memory☆11Nov 1, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Source code for our OSDI 2016 paper☆110Nov 11, 2018Updated 7 years ago
- [FAST 2022] FORD: Fast One-sided RDMA-based Distributed Transactions for Disaggregated Persistent Memory☆62Jun 22, 2024Updated last year
- A tutorial on RDMA based programming using code examples☆614Jan 3, 2020Updated 6 years ago
- ☆21Nov 29, 2022Updated 3 years ago
- Enterprise: Breadth-First Graph Traversal on GPUs. SC'15.☆33May 20, 2017Updated 8 years ago
- This is the Group-Meeting collections of HKUST System NetworkING (SING) Research Group.☆27Oct 3, 2019Updated 6 years ago
- This is the implementation repository of our SOSP'24 paper: Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value …☆24Oct 20, 2024Updated last year
- A P4 implementation of a 5G UPF for BMv2☆15Oct 11, 2021Updated 4 years ago
- A framework to understand RDMA☆410Oct 12, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GPU-accelerated LLM Training Simulator☆18Jun 26, 2025Updated 10 months ago
- Mirror of Apache crail (Incubating)☆151Jul 3, 2022Updated 3 years ago
- ☆33Mar 31, 2021Updated 5 years ago
- Rcmp: Reconstructing RDMA-based Memory Disaggregation via CXL☆64Dec 26, 2023Updated 2 years ago
- An RDMA-powered, fast, and scalable Paxos protocol☆26Jun 15, 2019Updated 6 years ago
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …☆16Sep 18, 2025Updated 7 months ago
- ☆21Jan 2, 2023Updated 3 years ago
- Quartz: A DRAM-based performance emulator for NVM☆162Jul 28, 2019Updated 6 years ago
- RDMA core userspace libraries and daemons☆2,208Apr 20, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆68May 1, 2017Updated 8 years ago
- A scalable RAID system to aggregate performance and capacity of the next-generation storage.☆15Jan 3, 2024Updated 2 years ago
- A Write-friendly and Cache-optimized Hashing Scheme for Non-volatile Memory Systems (MSST 2017, TPDS 2018)☆30Apr 11, 2018Updated 8 years ago
- Gengar, a distributed shared hybrid memory pool with RDMA support. Gengar allows applications to access remote DRAM/NVM in a large and gl…☆24May 24, 2022Updated 3 years ago
- SmartNIC☆14Dec 13, 2018Updated 7 years ago
- A Progam-Behavior-Guided Far Memory System☆36Oct 26, 2023Updated 2 years ago
- Sample code from thegeekinthecorner.com☆280Sep 13, 2020Updated 5 years ago
- Simulator of a memory controller to connect DRAMSim and FlashDIMMSim into one unified memory☆17Apr 4, 2024Updated 2 years ago
- OFI Programmer's Guide☆52Dec 29, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression☆19Jul 30, 2024Updated last year
- ddl-benchmarks: Benchmarks for Distributed Deep Learning☆36May 29, 2020Updated 5 years ago
- CoRM: Compactable Remote Memory over RDMA☆20Jun 18, 2021Updated 4 years ago
- MQ-ECN NS2 Simulation☆11Feb 26, 2016Updated 10 years ago
- Mallacc: Accelerating Memory Allocation☆13Jan 2, 2018Updated 8 years ago
- Primo: Practical Learning-Augmented Systems with Interpretable Models☆19Dec 26, 2023Updated 2 years ago
- Manycore platform Simulation tool for NoC-based platform at a Cycle-accurate level☆13Feb 22, 2018Updated 8 years ago