SymbioticLab / Sol
A Federated Execution Engine for Fast Distributed Computation Over Slow Networks
☆26Updated 4 years ago
Alternatives and similar repositories for Sol:
Users that are interested in Sol are comparing it to the libraries listed below
- MeshInsight: Dissecting Overheads of Service Mesh Sidecars☆47Updated last year
- Justitia provides RDMA isolation between applications with diverse requirements.☆40Updated 2 years ago
- Aequitas enables RPC-level QoS in datacenter networks.☆16Updated 2 years ago
- The prototype for NSDI paper "NetHint: White-Box Networking for Multi-Tenant Data Centers"☆26Updated last year
- Virtual Memory Abstraction for Serverless Architectures☆48Updated 3 years ago
- Expressive, Easy to Build, and High-Performance Application Networks☆16Updated this week
- Hydra adds resilience and high availability to remote memory solutions.☆30Updated 3 years ago
- ☆7Updated 7 years ago
- Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆43Updated 2 years ago
- Phoenix dataplane system service☆54Updated 10 months ago
- NetLock: Fast, Centralized Lock Management Using Programmable Switches☆30Updated 4 years ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Updated last year
- Nu is a new datacenter system that enables developers to build fungible applications that can use datacenter resources wherever they are.☆38Updated 11 months ago
- The code for both the framework and experiments from the NSDI '19 paper "Loom: Flexible and Efficient NIC Packet Scheduling"☆30Updated 6 years ago
- Managed collective communication service☆19Updated 8 months ago
- ☆44Updated 3 years ago
- MIND: In-Network Memory Management for Disaggregated Data Centers☆42Updated 3 years ago
- Tiresias is a GPU cluster manager for distributed deep learning training.☆153Updated 5 years ago
- RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers☆22Updated 4 years ago
- A Memory-Disaggregated Managed Runtime.☆66Updated 3 years ago
- ☆51Updated 10 months ago
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Updated 2 years ago
- Sources and examples for ASPLOS20 paper☆14Updated 4 years ago
- ☆20Updated 7 years ago
- A collection of tools, code, and documentation to understand the host network on real server hardware.☆34Updated 5 months ago
- Cupcake: A Compression Scheduler for Scalable Communication-Efficient Distributed Training (MLSys '23)☆9Updated last year
- Analyze network performance in distributed training☆18Updated 4 years ago
- Deduplication over dis-aggregated memory for Serverless Computing☆12Updated 3 years ago
- ☆26Updated 7 months ago
- This repository contains a list of papers on various topics (that I am working/worked on) in the system and networking area.☆79Updated 4 months ago