Thesys-lab / parity-modelsLinks
Learning-Based Coded Computation
☆47Updated 2 years ago
Alternatives and similar repositories for parity-models
Users that are interested in parity-models are comparing it to the libraries listed below
Sorting:
- ☆44Updated 3 years ago
- The prototype for NSDI paper "NetHint: White-Box Networking for Multi-Tenant Data Centers"☆26Updated last year
- Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆44Updated 2 years ago
- Code for reproducing experiments performed for Accoridon☆13Updated 4 years ago
- Managed collective communication service☆22Updated 9 months ago
- Justitia provides RDMA isolation between applications with diverse requirements.☆40Updated 3 years ago
- A Federated Execution Engine for Fast Distributed Computation Over Slow Networks☆26Updated 4 years ago
- ☆69Updated 2 years ago
- Aequitas enables RPC-level QoS in datacenter networks.☆16Updated 2 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆35Updated 2 years ago
- This repository describes I/O traces of Google storage servers and disks synthesized by Thesios. Thesios synthesizes representative I/O t…☆24Updated last year
- Cupcake: A Compression Scheduler for Scalable Communication-Efficient Distributed Training (MLSys '23)☆9Updated last year
- ☆20Updated 7 years ago
- Analyze network performance in distributed training☆18Updated 4 years ago
- ☆54Updated 4 years ago
- MIND: In-Network Memory Management for Disaggregated Data Centers☆42Updated 3 years ago
- Phoenix dataplane system service☆55Updated last year
- ☆38Updated 10 months ago
- Random collections of my interested research papers / projects☆20Updated 4 years ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆50Updated 2 years ago
- Benchmark Suite for RDMA Performance Isolation☆39Updated last year
- ☆14Updated 3 years ago
- ☆19Updated 2 years ago
- [ACM SIGCOMM 2024] "m3: Accurate Flow-Level Performance Estimation using Machine Learning" by Chenning Li, Arash Nasr-Esfahany, Kevin Zha…☆24Updated 8 months ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Updated last year
- ☆7Updated 8 years ago
- ☆24Updated 2 years ago
- Virtual Memory Abstraction for Serverless Architectures☆48Updated 3 years ago
- NetLock: Fast, Centralized Lock Management Using Programmable Switches☆32Updated 4 years ago
- ☆50Updated 2 years ago