JF-D/Parcae

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JF-D/Parcae)

JF-D / Parcae

☆22

Alternatives and similar repositories for Parcae

Users that are interested in Parcae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uclasystem / bamboo
View on GitHub
Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.
☆54Dec 11, 2022Updated 3 years ago
awslabs / optimizing-multitask-training-through-dynamic-pipelines
View on GitHub
Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
☆19Dec 8, 2023Updated 2 years ago
Hsword / SpotServe
View on GitHub
SpotServe: Serving Generative Large Language Models on Preemptible Instances
☆135Feb 22, 2024Updated 2 years ago
SymbioticLab / Oobleck
View on GitHub
A resilient distributed training framework
☆100Apr 11, 2024Updated 2 years ago
msr-fiddle / dejavu
View on GitHub
☆22Aug 13, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bytedance / QSync
View on GitHub
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Feb 23, 2024Updated 2 years ago
DataStates / datastates-llm
View on GitHub
LLM checkpointing for DeepSpeed/Megatron
☆25Nov 30, 2025Updated 7 months ago
skypilot-org / spot-traces
View on GitHub
Releasing the spot availability traces used in "Can't Be Late" paper.
☆27Mar 31, 2024Updated 2 years ago
UMass-LIDS / Proteus
View on GitHub
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆13Mar 7, 2024Updated 2 years ago
kungfu-team / tenplex
View on GitHub
Dynamic resources changes for multi-dimensional parallelism training
☆31Aug 22, 2025Updated 10 months ago
casys-kaist / EnvPipe
View on GitHub
☆27Aug 31, 2023Updated 2 years ago
Aleph-Alpha-Research / NeurIPS-WANT-submission-efficient-parallelization-layouts
View on GitHub
☆22Dec 15, 2023Updated 2 years ago
siasosp23 / artifacts
View on GitHub
☆24Aug 15, 2023Updated 2 years ago
microsoft / varuna
View on GitHub
☆250Jul 25, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / DLRM-FlexFlow
View on GitHub
Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…
☆29Oct 12, 2021Updated 4 years ago
illinoisdata / ElasticNotebook
View on GitHub
Enabling Live Migration for Computational Notebooks.
☆13Mar 11, 2024Updated 2 years ago
JF-D / Proteus
View on GitHub
☆24Jul 7, 2024Updated 2 years ago
ByteDance-Seed / StragglerAnalysis
View on GitHub
☆56Apr 30, 2025Updated last year
CCTV666 / UCAS-Master-Entrance-Guide
View on GitHub
💯💯💯 关于研究生入学考试-中国科学院大学计算机学硕的学习指南与资源分享。
☆13Jul 28, 2019Updated 6 years ago
Scientific-Computing-Lab / MPI-rigen
View on GitHub
MPI Code Generation through Domain-Specific Language Models
☆16Nov 19, 2024Updated last year
hao-ai-lab / vllm-ltr
View on GitHub
[NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank
☆81Nov 4, 2024Updated last year
kazukiosawa / pipe-fisher
View on GitHub
☆10Apr 29, 2023Updated 3 years ago
msr-fiddle / CheckFreq
View on GitHub
☆57Jan 25, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
microsoft / nnscaler
View on GitHub
nnScaler: Compiling DNN models for Parallel Training
☆135Jul 2, 2026Updated 2 weeks ago
thustorage / GCR
View on GitHub
code repo for GCR [FAST'26]
☆16Mar 3, 2026Updated 4 months ago
Rivendile / Muri
View on GitHub
Artifacts for our SIGCOMM'22 paper Muri
☆44Dec 29, 2023Updated 2 years ago
hyungyokim / LIA_AMXGPU
View on GitHub
[ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading
☆13Jun 28, 2025Updated last year
zenrran4nlp / Awesome-LLM-Inference-Serving
View on GitHub
☆50Apr 29, 2025Updated last year
Networked-System-and-Security-Group / Mnemosyne
View on GitHub
APNet'25 - Mnemosyne: Lightweight and Fast Error Recovery for LLM Training in a Just-In-Time Manner
☆15Aug 3, 2025Updated 11 months ago
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated 11 months ago
nicklashansen / adaptive-learning-rate-schedule
View on GitHub
PyTorch implementation of the "Learning an Adaptive Learning Rate Schedule" paper found here: https://arxiv.org/abs/1909.09712.
☆12Jan 15, 2020Updated 6 years ago
einverne / AndroidFaceDetectDemo
View on GitHub
Android 人脸检测 android.media, play service, Face++
☆11Aug 13, 2016Updated 9 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ParCIS / Chimera
View on GitHub
Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.
☆72Mar 20, 2025Updated last year
Thesys-lab / Helix-ASPLOS25
View on GitHub
Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
☆93Oct 15, 2025Updated 9 months ago
DachengLi1 / AMP
View on GitHub
(NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.
☆44Nov 4, 2022Updated 3 years ago
nex-agi / NexVenusCL
View on GitHub
Nex Venus Communication Library
☆75Nov 17, 2025Updated 8 months ago
unist-ssl / IIDP
View on GitHub
☆13Apr 7, 2025Updated last year
tadglines / wfq
View on GitHub
Weighted fair queue algorithm
☆24Oct 25, 2014Updated 11 years ago
unist-ssl / JABAS
View on GitHub
"JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)
☆16Apr 7, 2025Updated last year