EMDC-OS / power-aware-tritonLinks

Know Your Enemy To Save Cloud Energy: Energy-Performance Characterization of Machine Learning Serving (HPCA '23)

☆13

Alternatives and similar repositories for power-aware-triton

Users that are interested in power-aware-triton are comparing it to the libraries listed below

Sorting:

Sys-KU / DeepPlan
[ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆57Updated this week
arcs-skku / EMDC_llvm
Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
☆38Updated last month
mlsys-seo / ooo-backprop
☆25Updated 2 years ago
VIA-Research / vTrain
☆73Updated 2 months ago
EMDC-OS / mg-lru
☆9Updated last month
casys-kaist / glet
☆49Updated 7 months ago
microsoft / taccl
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches
☆74Updated 2 years ago
unist-ssl / IIDP
☆12Updated 4 months ago
boringlee24 / socc22-miso
MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters
☆20Updated 2 years ago
casys-kaist / LLMServingSim
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale
☆127Updated 3 weeks ago
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆143Updated 6 months ago
mlcommons / chakra
Repository for MLCommons Chakra schema and tools
☆114Updated last week
casys-kaist / EnvPipe
☆25Updated last year
UMass-LIDS / Proteus
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆13Updated last year
microsoft / msccl-tools
Synthesizer for optimal collective communication algorithms
☆113Updated last year
mutinifni / splitwise-sim
LLM serving cluster simulator
☆108Updated last year
s3yonsei / blocked_samples
☆29Updated 5 months ago
microsoft / NPKit
NCCL Profiling Kit
☆140Updated last year
Raphael-Hao / Abacus
☆37Updated last month
uclasystem / bamboo
Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.
☆50Updated 2 years ago
microsoft / msccl
Microsoft Collective Communication Library
☆353Updated last year
skypilot-org / spot-traces
Releasing the spot availability traces used in "Can't Be Late" paper.
☆22Updated last year
HuaizhengZhang / MIGProfiler
Multi-Instance-GPU profiling tool
☆60Updated 2 years ago
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆98Updated 2 years ago
sitar-lab / NeuSight
☆48Updated last month
mcrl / tccl
Thunder Research Group's Collective Communication Library
☆39Updated last month
gajagajago / deepshare
Network Contention-Aware Cluster Scheduling with Reinforcement Learning (IEEE ICPADS'23)
☆16Updated last month
calculon-ai / calculon
☆145Updated last year
HPMLL / BurstGPT
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
☆191Updated 2 weeks ago
msr-fiddle / DS-Analyzer
☆38Updated 4 years ago