SymbioticLab / Fluid
A Generic Resource-Aware Hyperparameter Tuning Execution Engine
☆15Updated 3 years ago
Alternatives and similar repositories for Fluid:
Users that are interested in Fluid are comparing it to the libraries listed below
- ☆35Updated 4 years ago
- ☆22Updated 5 years ago
- ☆23Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- ☆14Updated 3 years ago
- ☆24Updated last year
- ☆43Updated 3 years ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆48Updated 2 years ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127Updated 2 years ago
- ☆21Updated 2 years ago
- ☆47Updated 2 years ago
- Fine-grained GPU sharing primitives☆141Updated 4 years ago
- Model-less Inference Serving☆84Updated last year
- Artifacts for our ASPLOS'23 paper ElasticFlow☆53Updated 9 months ago
- Tiresias is a GPU cluster manager for distributed deep learning training.☆151Updated 4 years ago
- Helios Traces from SenseTime☆53Updated 2 years ago
- ☆44Updated last month
- ☆20Updated 3 years ago
- ☆37Updated 3 years ago
- Metis: Learning to Schedule Long-Running Applications in Shared Container Clusters with at Scale☆17Updated 4 years ago
- Machine learning on serverless platform☆8Updated 2 years ago
- Multi-Instance-GPU profiling tool☆56Updated last year
- SOTA Learning-augmented Systems☆34Updated 2 years ago
- ☆53Updated 4 years ago
- Analyze network performance in distributed training☆17Updated 4 years ago
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆12Updated 8 months ago
- Code for reproducing experiments performed for Accoridon☆13Updated 3 years ago
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion☆32Updated 9 months ago
- ☆23Updated 2 years ago
- ☆14Updated 2 years ago