arcs-skku / EMDC_llvmLinks
Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
☆38Updated 4 months ago
Alternatives and similar repositories for EMDC_llvm
Users that are interested in EMDC_llvm are comparing it to the libraries listed below
Sorting:
- ☆23Updated 3 years ago
- Know Your Enemy To Save Cloud Energy: Energy-Performance Characterization of Machine Learning Serving (HPCA '23)☆13Updated 4 months ago
- ☆10Updated 4 months ago
- ☆13Updated 6 months ago
- ☆14Updated 4 months ago
- ☆30Updated last month
- Load generator and trace sampler for serverless computing☆25Updated last week
- MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters☆20Updated 2 years ago
- Heterogeneous Memory Software Development Kit☆84Updated 10 months ago
- "JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)☆16Updated 6 months ago
- [ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆57Updated 2 months ago
- ☆53Updated 10 months ago
- ☆196Updated 2 months ago
- ☆312Updated last year
- Artifacts for our NSDI'23 paper TGS☆89Updated last year
- 🚨 Prediction of the Resource Consumption of Distributed Deep Learning Systems☆15Updated 2 years ago
- [USENIX ATC 2021] Exploring the Design Space of Page Management for Multi-Tiered Memory Systems☆47Updated 3 years ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆15Updated 4 years ago
- ☆103Updated 2 years ago
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆144Updated 3 months ago
- ☆40Updated 2 years ago
- ☆195Updated 6 years ago
- ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale☆450Updated last week
- ☆73Updated 5 months ago
- ☆25Updated 2 years ago
- Network Contention-Aware Cluster Scheduling with Reinforcement Learning (IEEE ICPADS'23)☆17Updated 3 months ago
- An interference-aware scheduler for fine-grained GPU sharing☆150Updated 9 months ago
- LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks☆15Updated 3 years ago
- ☆20Updated 9 months ago
- Intercepting CUDA runtime calls with LD_PRELOAD☆42Updated 11 years ago