Shigangli / Chimera

Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines.

☆46

Related projects ⓘ

Alternatives and complementary repositories for Chimera

parasailteam / coconet
☆73Updated last year
zhuohan123 / terapipe
☆65Updated 3 years ago
alpa-projects / mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
☆78Updated last year
microsoft / nnscaler
nnScaler: Compiling DNN models for Parallel Training
☆74Updated 3 weeks ago
microsoft / SuperScaler
An experimental parallel training platform
☆52Updated 7 months ago
msr-fiddle / piper
☆9Updated 2 years ago
HPMLL / BurstGPT
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
☆132Updated last month
Raphael-Hao / brainstorm
Compiler for Dynamic Neural Networks
☆43Updated last year
Shigangli / Magicube
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆81Updated last year
hao-ai-lab / MuxServe
☆46Updated 5 months ago
HPDL-Group / Merak
☆74Updated last month
alibaba / easydist
Automated Parallelization System and Infrastructure for Multiple Ecosystems
☆75Updated this week
Mutinifni / splitwise-sim
LLM serving cluster simulator
☆81Updated 6 months ago
YukeWang96 / MGG_OSDI23
Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…
☆37Updated 8 months ago
LoongServe / LoongServe
☆52Updated last week
SymbioticLab / Oobleck
A resilient distributed training framework
☆85Updated 7 months ago
microsoft / msccl-tools
Synthesizer for optimal collective communication algorithms
☆99Updated 7 months ago
LLMServe / dLoRA-artifact
☆14Updated 5 months ago
pku-liang / MAGIS
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆43Updated 5 months ago
YukeWang96 / QGTC_PPoPP22
Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.
☆27Updated 2 years ago
SJTU-IPADS / ugache
☆23Updated last year
AlibabaResearch / mononn
☆19Updated 4 months ago
pkusys / ElasticFlow
Artifacts for our ASPLOS'23 paper ElasticFlow
☆52Updated 6 months ago
alibaba / llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
☆57Updated 5 months ago
AlibabaPAI / DAPPLE
An Efficient Pipelined Data Parallel Approach for Training Large Model
☆70Updated 3 years ago
zhaiyi000 / tlm
☆30Updated 4 months ago
c3sr / tcu_scope
☆44Updated 5 years ago
casys-kaist / HUVM
☆23Updated 2 years ago
eniac / paella
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆57Updated 6 months ago
Hsword / Awesome-Machine-Learning-System-Papers
☆56Updated 2 years ago