msr-fiddle/blox

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/msr-fiddle/blox)

msr-fiddle / blox

☆44

Alternatives and similar repositories for blox

Users that are interested in blox are comparing it to the libraries listed below

Sorting:

S-Lab-System-Group / Lucid
View on GitHub
Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
☆58May 21, 2023Updated 2 years ago
msr-fiddle / synergy
View on GitHub
☆52Dec 13, 2022Updated 3 years ago
S-Lab-System-Group / Awesome-DL-Scheduling-Papers
View on GitHub
☆323Jan 22, 2024Updated 2 years ago
liu445126256 / FuncPipe
View on GitHub
☆11Jul 9, 2023Updated 2 years ago
shuoshuc / FabricEval
View on GitHub
An evaluation framework for data center traffic engineering.
☆14Jul 28, 2024Updated last year
siasosp23 / artifacts
View on GitHub
☆24Aug 15, 2023Updated 2 years ago
Rivendile / Muri
View on GitHub
Artifacts for our SIGCOMM'22 paper Muri
☆43Dec 29, 2023Updated 2 years ago
InternLM / AcmeTrace
View on GitHub
☆175Mar 12, 2024Updated last year
S-Lab-System-Group / ChronusArtifact
View on GitHub
☆23Jan 7, 2022Updated 4 years ago
S-Lab-System-Group / Hydro
View on GitHub
Surrogate-based Hyperparameter Tuning System
☆28Jun 29, 2023Updated 2 years ago
alibaba / alibaba-lingjun-dataset-2023
View on GitHub
☆64Jun 25, 2024Updated last year
casys-kaist / EnvPipe
View on GitHub
☆26Aug 31, 2023Updated 2 years ago
michaelzhiluo / starburst
View on GitHub
Burstable Cloud Scheduler
☆16Jun 6, 2024Updated last year
gudiandian / ElasticFlow
View on GitHub
☆17May 10, 2024Updated last year
stanford-futuredata / gavel
View on GitHub
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆137Jul 25, 2024Updated last year
netiken / m4
View on GitHub
[TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Om Chabra, Arash Nasr-Esfahany, Kevin Zhao, Pratees…
☆16Nov 18, 2025Updated 3 months ago
suquark / hoplite
View on GitHub
☆44Sep 6, 2021Updated 4 years ago
jasperzhong / swift
View on GitHub
☆15Apr 20, 2022Updated 3 years ago
uw-mad-dash / shockwave
View on GitHub
Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]
☆47Nov 24, 2022Updated 3 years ago
SymbioticLab / Oobleck
View on GitHub
A resilient distributed training framework
☆97Apr 11, 2024Updated last year
pengyanghua / DL2
View on GitHub
a deep learning-driven scheduler for elastic training in deep learning clusters
☆31Jan 14, 2021Updated 5 years ago
bytedance / QSync
View on GitHub
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Feb 23, 2024Updated 2 years ago
Thesys-lab / Helix-ASPLOS25
View on GitHub
Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
☆77Oct 15, 2025Updated 4 months ago
pkusys / TGS
View on GitHub
Artifacts for our NSDI'23 paper TGS
☆96Jun 10, 2024Updated last year
romilbhardwaj / cilantro
View on GitHub
Source code for OSDI 2023 paper titled "Cilantro - Performance-Aware Resource Allocation for General Objectives via Online Feedback"
☆40Jul 6, 2023Updated 2 years ago
netiken / m3
View on GitHub
[ACM SIGCOMM 2024] "m3: Accurate Flow-Level Performance Estimation using Machine Learning" by Chenning Li, Arash Nasr-Esfahany, Kevin Zha…
☆25Oct 2, 2024Updated last year
S-Lab-System-Group / HeliosArtifact
View on GitHub
HeliosArtifact
☆22Sep 27, 2022Updated 3 years ago
xpan413 / FSMoE
View on GitHub
☆16Jan 14, 2025Updated last year
CSU-NetLab / A2TP-Eurosys2023
View on GitHub
☆11Mar 13, 2023Updated 2 years ago
microsoft / TE-CCL
View on GitHub
☆49Aug 27, 2024Updated last year
All-less / faas-scheduling-benchmark
View on GitHub
A benchmark suite for evaluating FaaS scheduler.
☆23Nov 5, 2022Updated 3 years ago
pengyanghua / optimus
View on GitHub
A Deep Learning Cluster Scheduler
☆37Jan 11, 2021Updated 5 years ago
vineeths96 / Gradient-Compression
View on GitHub
We present a set of all-reduce compatible gradient compression algorithms which significantly reduce the communication overhead while mai…
☆10Nov 14, 2021Updated 4 years ago
hiddenlayer2020 / ML-Job-Scheduler-MLFS
View on GitHub
☆11Dec 18, 2020Updated 5 years ago
microsoft / taccl
View on GitHub
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches
☆80Jul 25, 2023Updated 2 years ago
kungfu-team / tenplex
View on GitHub
Dynamic resources changes for multi-dimensional parallelism training
☆30Aug 22, 2025Updated 6 months ago
petuum / adaptdl
View on GitHub
Resource-adaptive cluster scheduler for deep learning training.
☆454Mar 5, 2023Updated 3 years ago
CentML / lorafusion
View on GitHub
LoRAFusion: Efficient LoRA Fine-Tuning for LLMs
☆24Sep 23, 2025Updated 5 months ago
unist-ssl / IIDP
View on GitHub
☆13Apr 7, 2025Updated 10 months ago