Oneflow-Inc / occl

☆11

Related projects ⓘ

Alternatives and complementary repositories for occl

parasailteam / coconet
☆73Updated last year
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆85Updated last year
casys-kaist / HUVM
☆23Updated 2 years ago
eniac / paella
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆57Updated 6 months ago
msr-fiddle / CheckFreq
☆51Updated 3 years ago
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆111Updated 6 months ago
casys-kaist / glet
☆41Updated last year
zhuohan123 / terapipe
☆66Updated 3 years ago
SJTU-IPADS / ugache
☆23Updated last year
S-Lab-System-Group / Awesome-ML-for-System
SOTA Learning-augmented Systems
☆33Updated 2 years ago
msr-fiddle / harmony
☆16Updated last year
SJTU-IPADS / disb
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆54Updated 3 months ago
Shigangli / Chimera
Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines.
☆46Updated 11 months ago
platformxlab / G10
☆33Updated last year
c3sr / tcu_scope
☆44Updated 5 years ago
Sys-KU / DeepPlan
Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)
☆54Updated 7 months ago
SJTU-IPADS / reef-artifacts
A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.
☆39Updated 2 years ago
alibaba / llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
☆57Updated 5 months ago
rkhan055 / SHADE
SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training
☆29Updated last year
Hsword / Awesome-Machine-Learning-System-Papers
☆56Updated 2 years ago
casys-kaist / EnvPipe
☆23Updated last year
JF-D / Parcae
☆12Updated 6 months ago
msr-fiddle / CoorDL
☆23Updated last year
UMass-LIDS / Proteus
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆8Updated 8 months ago
microsoft / nnscaler
nnScaler: Compiling DNN models for Parallel Training
☆75Updated 3 weeks ago
microsoft / taccl
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches
☆64Updated last year
AlibabaResearch / recom
An Optimizing Compiler for Recommendation Model Inference
☆22Updated 9 months ago
LLMServe / SwiftTransformer
High performance Transformer implementation in C++.
☆82Updated 2 months ago
UofT-EcoSystem / hotline
☆31Updated last year
msr-fiddle / DS-Analyzer
☆35Updated 3 years ago