aravic / generalizable-device-placement
Reference code for https://arxiv.org/abs/1906.08879
☆16Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for generalizable-device-placement
- ☆38Updated 4 years ago
- Metis: Learning to Schedule Long-Running Applications in Shared Container Clusters with at Scale☆17Updated 4 years ago
- HeliosArtifact☆18Updated 2 years ago
- https://arxiv.org/abs/1706.04972☆42Updated 5 years ago
- Surrogate-based Hyperparameter Tuning System☆27Updated last year
- This repository contains code for the paper: Bergsma S., Zeyl T., Senderovich A., and Beck J. C., "Generating Complex, Realistic Cloud Wo…☆42Updated 3 years ago
- ☆19Updated 2 years ago
- Code for "Solving Large-Scale Granular Resource Allocation Problems Efficiently with POP", which appeared at SOSP 2021☆24Updated 2 years ago
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆11Updated 5 months ago
- ☆29Updated 4 months ago
- ☆22Updated 2 months ago
- ☆14Updated 2 years ago
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion☆32Updated 6 months ago
- Artifact for OSDI'21 GNNAdvisor: An Adaptive and Efficient Runtime System for GNN Acceleration on GPUs.☆63Updated last year
- ☆218Updated last year
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆60Updated 2 years ago
- A Generic Resource-Aware Hyperparameter Tuning Execution Engine☆15Updated 2 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆37Updated 8 months ago
- [ICLR 2022] "PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication" by Cheng Wan, Y…☆31Updated last year
- A Deep Learning Cluster Scheduler☆37Updated 3 years ago
- Artifacts for our SIGCOMM'22 paper Muri☆40Updated 10 months ago
- [NSDI 2023] TopoOpt: Optimizing the Network Topology for Distributed DNN Training☆26Updated 2 months ago
- ☆48Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated last year
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆42Updated 3 years ago
- Deep reinforcement learning for REsource Allocation in streaM processing☆27Updated last year
- ☆22Updated 3 years ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆27Updated 2 years ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆78Updated last year
- Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs☆50Updated last year