Raphael-Hao / brainstormLinks

Compiler for Dynamic Neural Networks

☆46

Alternatives and similar repositories for brainstorm

Users that are interested in brainstorm are comparing it to the libraries listed below

Sorting:

parasailteam / coconet
☆83Updated 2 years ago
alpa-projects / mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
☆88Updated 2 years ago
yuyangJin / PerFlow-AI
PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.
☆24Updated 2 weeks ago
SJTU-IPADS / ugache
☆23Updated last year
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆101Updated 2 years ago
HPMLL / BurstGPT
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
☆215Updated 3 months ago
mutinifni / splitwise-sim
LLM serving cluster simulator
☆116Updated last year
pkusys / ElasticFlow
Artifacts for our ASPLOS'23 paper ElasticFlow
☆54Updated last year
Raphael-Hao / Abacus
☆38Updated 4 months ago
alibaba / llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
☆62Updated last year
uclasystem / bamboo
Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.
☆53Updated 2 years ago
microsoft / SuperScaler
An experimental parallel training platform
☆54Updated last year
Thesys-lab / Helix-ASPLOS25
Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
☆68Updated last week
google / iopddl
Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning
☆23Updated 5 months ago
alibaba-edu / qwen-bailian-usagetraces-anon
☆55Updated 4 months ago
SJTU-IPADS / disb
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆54Updated last year
pku-liang / MAGIS
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆55Updated last year
AlibabaResearch / recom
An Optimizing Compiler for Recommendation Model Inference
☆26Updated 4 months ago
EfficientLLMSys / MuxServe
☆13Updated last year
SJTU-IPADS / reef-artifacts
A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.
☆43Updated 3 years ago
LLMServe / dLoRA-artifact
☆27Updated last year
SymbioticLab / Oobleck
A resilient distributed training framework
☆96Updated last year
zhaiyi000 / tlm
☆44Updated last year
LoongServe / LoongServe
☆124Updated 11 months ago
AlibabaResearch / mononn
☆31Updated last year
eniac / paella
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆62Updated last year
zhuohan123 / terapipe
☆75Updated 4 years ago
JF-D / Proteus
☆23Updated last year
awslabs / optimizing-multitask-training-through-dynamic-pipelines
Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
☆20Updated last year
zhaiyi000 / tlp
☆41Updated last year