msr-fiddle / CoorDLLinks

☆24

Alternatives and similar repositories for CoorDL

Users that are interested in CoorDL are comparing it to the libraries listed below

Sorting:

msr-fiddle / DS-Analyzer
☆38Updated 4 years ago
msr-fiddle / CheckFreq
☆56Updated 4 years ago
SymbioticLab / Salus
Fine-grained GPU sharing primitives
☆147Updated 5 months ago
casys-kaist / HUVM
☆25Updated 3 years ago
SymbioticLab / Tiresias
Tiresias is a GPU cluster manager for distributed deep learning training.
☆164Updated 5 years ago
msr-fiddle / synergy
☆52Updated 3 years ago
suquark / hoplite
☆44Updated 4 years ago
casys-kaist / glet
☆53Updated last year
pkusys / ElasticFlow
Artifacts for our ASPLOS'23 paper ElasticFlow
☆55Updated last year
uclasystem / bamboo
Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.
☆55Updated 3 years ago
netx-repo / PipeSwitch
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127Updated 3 years ago
stanford-futuredata / gavel
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆136Updated last year
rkhan055 / SHADE
SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training
☆35Updated 2 years ago
stanford-mast / INFaaS
Model-less Inference Serving
☆92Updated 2 years ago
msr-fiddle / harmony
☆17Updated 3 years ago
msr-fiddle / philly-traces
☆197Updated 6 years ago
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆158Updated last month
eth-easl / cachew
ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).
☆40Updated last year
Sys-KU / DeepPlan
[ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆56Updated 5 months ago
dsrhaslab / monarch
Accelerating Deep Learning Training Through Transparent Storage Tiering (CCGrid'22)
☆19Updated 3 years ago
uclasystem / dorylus
Dorylus: Affordable, Scalable, and Accurate GNN Training
☆76Updated 4 years ago
jasperzhong / swift
☆15Updated 3 years ago
S-Lab-System-Group / Awesome-ML-for-System
SOTA Learning-augmented Systems
☆37Updated 3 years ago
Azure / msccl
Microsoft Collective Communication Library
☆66Updated last year
csl-iisc / GPM-ASPLOS22
☆36Updated last year
SJTU-IPADS / disb
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆57Updated last year
uw-mad-dash / shockwave
Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]
☆46Updated 3 years ago
tbd-ai / tbd-suite
☆47Updated 3 years ago
S-Lab-System-Group / HeliosData
Helios Traces from SenseTime
☆61Updated 3 years ago
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆104Updated 3 years ago