SamsungLabs / FastFlowLinks

FastFlow is a system that automatically detects CPU bottlenecks in deep learning training pipelines and resolves the bottlenecks with data pipeline offloading to remote resources .

☆27

Alternatives and similar repositories for FastFlow

Users that are interested in FastFlow are comparing it to the libraries listed below

Sorting:

casys-kaist / EnvPipe
☆25Updated last year
Sys-KU / DeepPlan
[ACM EuroSys '23] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆56Updated last year
rkhan055 / SHADE
SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training
☆33Updated 2 years ago
casys-kaist / glet
☆49Updated 5 months ago
UMass-LIDS / Proteus
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆12Updated last year
zhuangwang93 / Espresso
Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…
☆15Updated last year
msr-fiddle / CheckFreq
☆53Updated 4 years ago
pkusys / ElasticFlow
Artifacts for our ASPLOS'23 paper ElasticFlow
☆51Updated last year
msr-fiddle / DS-Analyzer
☆36Updated 4 years ago
casys-kaist / HUVM
☆23Updated 2 years ago
msr-fiddle / CoorDL
☆24Updated last year
VIA-Research / vTrain
☆68Updated last week
casys-kaist / LLMServingSim
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale
☆117Updated this week
Thesys-lab / Helix-ASPLOS25
Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
☆46Updated 6 months ago
msr-fiddle / synergy
☆50Updated 2 years ago
SymbioticLab / ModelKeeper
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
☆34Updated 2 years ago
SJTU-IPADS / disb
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆52Updated 9 months ago
SJTU-IPADS / ugache
☆22Updated last year
Raphael-Hao / Abacus
☆37Updated 3 years ago
ucare-uchicago / ev-store-dlrm
☆30Updated last year
ruipeterpan / marconi
Artifact for "Marconi: Prefix Caching for the Era of Hybrid LLMs" [MLSys '25 Outstanding Paper Honorable Mention]
☆10Updated 3 months ago
yuyangJin / PerFlow-AI
PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.
☆19Updated last month
msr-fiddle / harmony
☆16Updated 2 years ago
SamsungLabs / Metis
[ATC '24] Metis: Fast automatic distributed training on heterogeneous GPUs (https://www.usenix.org/conference/atc24/presentation/um)
☆26Updated 6 months ago
mutinifni / splitwise-sim
LLM serving cluster simulator
☆102Updated last year
S-Lab-System-Group / Awesome-ML-for-System
SOTA Learning-augmented Systems
☆36Updated 3 years ago
HuaizhengZhang / MIGProfiler
Multi-Instance-GPU profiling tool
☆58Updated 2 years ago
ranggihwang / Pregated_MoE
☆47Updated last year
WukLab / preble
Stateful LLM Serving
☆70Updated 2 months ago
microsoft / SuperScaler
An experimental parallel training platform
☆54Updated last year