EMDC-OS / mg-lru
☆9Updated 2 months ago
Alternatives and similar repositories for mg-lru:
Users that are interested in mg-lru are comparing it to the libraries listed below
- Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.☆38Updated 2 months ago
- Know Your Enemy To Save Cloud Energy: Energy-Performance Characterization of Machine Learning Serving (HPCA '23)☆13Updated 2 months ago
- ☆23Updated 3 years ago
- ☆14Updated 2 months ago
- ☆10Updated 5 months ago
- ☆24Updated last month
- ☆12Updated 6 months ago
- "JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)☆12Updated 2 months ago
- ☆47Updated 2 months ago
- Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)☆57Updated 11 months ago
- ☆279Updated last year
- Curated collection of papers in machine learning systems☆264Updated 3 weeks ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆95Updated last month
- MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters☆18Updated last year
- LLM serving cluster simulator☆94Updated 11 months ago
- ☆37Updated 3 years ago
- An interference-aware scheduler for fine-grained GPU sharing☆129Updated last month
- Helios Traces from SenseTime☆53Updated 2 years ago
- [ATC '24] Metis: Fast automatic distributed training on heterogeneous GPUs (https://www.usenix.org/conference/atc24/presentation/um)☆25Updated 4 months ago
- This repository is established to store personal notes and annotated papers during daily research.☆115Updated this week
- Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs☆53Updated last year
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆10Updated last year
- Artifacts for our NSDI'23 paper TGS☆76Updated 9 months ago
- MIST: High-performance IoT Stream Processing☆17Updated 6 years ago
- ☆186Updated 5 years ago
- Exploring the Design Space of Page Management for Multi-Tiered Memory Systems (USENIX ATC '21)☆45Updated 2 years ago
- ☆18Updated 9 months ago
- ☆40Updated 8 months ago
- Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020☆126Updated 8 months ago
- ☆49Updated 2 years ago