alibaba / GPU-scheduler-for-deep-learningLinks

GPU-scheduler-for-deep-learning

☆210

Alternatives and similar repositories for GPU-scheduler-for-deep-learning

Users that are interested in GPU-scheduler-for-deep-learning are comparing it to the libraries listed below

Sorting:

SymbioticLab / Salus
Fine-grained GPU sharing primitives
☆146Updated 3 months ago
SymbioticLab / Tiresias
Tiresias is a GPU cluster manager for distributed deep learning training.
☆163Updated 5 years ago
stanford-futuredata / gavel
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆131Updated last year
netx-repo / PipeSwitch
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆126Updated 3 years ago
microsoft / hivedscheduler
Kubernetes Scheduler for Deep Learning
☆262Updated 3 years ago
msr-fiddle / philly-traces
☆196Updated 6 years ago
kubedl-io / morphling
Automatic tuning for ML model deployment on Kubernetes
☆81Updated last year
kzhang28 / Optimus
An Efficient Dynamic Resource Scheduler for Deep Learning Clusters
☆42Updated 8 years ago
pengyanghua / optimus
A Deep Learning Cluster Scheduler
☆39Updated 4 years ago
pkusys / TGS
Artifacts for our NSDI'23 paper TGS
☆89Updated last year
NTHU-LSALAB / Gemini
An efficient GPU resource sharing system with fine-grained control for Linux platforms.
☆85Updated last year
google / nccl-fastsocket
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
☆122Updated last year
S-Lab-System-Group / HeliosData
Helios Traces from SenseTime
☆58Updated 3 years ago
Mellanox / nccl-rdma-sharp-plugins
RDMA and SHARP plugins for nccl library
☆212Updated 3 weeks ago
casys-kaist / glet
☆53Updated 10 months ago
stanford-mast / INFaaS
Model-less Inference Serving
☆92Updated 2 years ago
msr-fiddle / synergy
☆51Updated 2 years ago
S-Lab-System-Group / Awesome-DL-Scheduling-Papers
☆312Updated last year
eniac / paella
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆62Updated last year
Bruce-Lee-LY / cuda_hook
Hooked CUDA-related dynamic libraries by using automated code generation tools.
☆169Updated last year
uwsampl / nexus
☆83Updated 4 months ago
sakjain92 / Fractional-GPUs
Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions
☆160Updated 6 years ago
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆150Updated 9 months ago
geoffxy / habitat
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆62Updated 2 years ago
petuum / adaptdl
Resource-adaptive cluster scheduler for deep learning training.
☆448Updated 2 years ago
zw0610 / zw0610.github.io
☆58Updated 5 years ago
cjg / GVirtuS
This repository is an archive. Refer to https://github.com/gvirtus/GVirtuS
☆45Updated 3 years ago
Raphael-Hao / Abacus
☆38Updated 4 months ago
msr-fiddle / blox
☆43Updated last year
AlibabaPAI / DAPPLE
An Efficient Pipelined Data Parallel Approach for Training Large Model
☆76Updated 4 years ago