cirquit / hivemind-multi-cloudLinks

☆9

Alternatives and similar repositories for hivemind-multi-cloud

Users that are interested in hivemind-multi-cloud are comparing it to the libraries listed below

Sorting:

dywsjtu / apparate
Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]
☆25Updated 7 months ago
SymbioticLab / ModelKeeper
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
☆35Updated 2 years ago
casys-kaist / EnvPipe
☆25Updated last year
Hsword / SpotServe
SpotServe: Serving Generative Large Language Models on Preemptible Instances
☆123Updated last year
SymbioticLab / Oobleck
A resilient distributed training framework
☆95Updated last year
uw-mad-dash / shockwave
Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]
☆44Updated 2 years ago
llm-db / FineInfer
Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)
☆17Updated last year
James-QiuHaoran / LLM-serving-with-proxy-models
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | A tiny BERT model can tell you the verbosity of an …
☆36Updated last year
S-Lab-System-Group / Hydro
Surrogate-based Hyperparameter Tuning System
☆28Updated 2 years ago
suquark / hoplite
☆45Updated 3 years ago
msr-fiddle / blox
☆44Updated last year
kungfu-team / tenplex
Dynamic resources changes for multi-dimensional parallelism training
☆26Updated 8 months ago
michaelzhiluo / starburst
Burstable Cloud Scheduler
☆14Updated last year
msr-fiddle / CheckFreq
☆54Updated 4 years ago
uclasystem / bamboo
Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.
☆50Updated 2 years ago
WukLab / preble
Stateful LLM Serving
☆76Updated 4 months ago
zhengzangw / Sequence-Scheduling
PyTorch implementation of paper "Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline".
☆89Updated 2 years ago
zhuangwang93 / Espresso
Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…
☆15Updated last year
WukLab / InferCept
☆28Updated last year
SymbioticLab / Fluid
A Generic Resource-Aware Hyperparameter Tuning Execution Engine
☆15Updated 3 years ago
alpa-projects / mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
☆82Updated 2 years ago
eth-easl / cachew
ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).
☆39Updated 10 months ago
Ying1123 / VTC-artifact
☆32Updated last year
msr-fiddle / DS-Analyzer
☆38Updated 4 years ago
Rivendile / Muri
Artifacts for our SIGCOMM'22 paper Muri
☆42Updated last year
S-Lab-System-Group / ChronusArtifact
☆22Updated 3 years ago
zhuangwang93 / Cupcake
Cupcake: A Compression Scheduler for Scalable Communication-Efficient Distributed Training (MLSys '23)
☆9Updated 2 years ago
jasperzhong / swift
☆14Updated 3 years ago
mutinifni / splitwise-sim
LLM serving cluster simulator
☆107Updated last year
hao-ai-lab / MuxServe
☆64Updated last year