ruipeterpan / cs759-sp21
CS/ECE/ME/EP 759 (High Performance Computing for Engineering Applications) Course Project: Cautiously Aggressive GPU Space Sharing to Improve Resource Utilization and Job Efficiency
☆8Updated 3 years ago
Related projects: ⓘ
- Code for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆38Updated last year
- SOTA Learning-augmented Systems☆32Updated 2 years ago
- ☆48Updated 3 years ago
- ☆19Updated last year
- ☆31Updated 3 months ago
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion☆32Updated 4 months ago
- A language for video analytics☆13Updated last year
- ☆18Updated 2 years ago
- ☆37Updated 2 years ago
- Personal blog + reading notes on system-ish papers☆15Updated 10 months ago
- Thinking is hard - automate it☆18Updated 2 years ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆46Updated last year
- ☆21Updated 5 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆31Updated last year
- ☆22Updated 2 years ago
- Stateful LLM Serving☆25Updated last month
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆39Updated 2 years ago
- ☆45Updated last year
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆8Updated 6 months ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆124Updated 2 years ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆32Updated last month
- Analyze network performance in distributed training☆16Updated 3 years ago
- Artifacts for our ASPLOS'23 paper ElasticFlow☆51Updated 4 months ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆28Updated last year
- ☆29Updated 6 months ago
- Artifacts for our SIGCOMM'22 paper Muri☆38Updated 8 months ago
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆19Updated 3 years ago
- ☆13Updated 2 years ago
- ☆24Updated 4 years ago
- ☆40Updated last year