liangyuRain / ForestCollLinks

☆13

Alternatives and similar repositories for ForestColl

Users that are interested in ForestColl are comparing it to the libraries listed below

Sorting:

microsoft / taccl
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches
☆77Updated 2 years ago
Thesys-lab / Helix-ASPLOS25
Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
☆73Updated last month
axio-project / FuseLink
Efficient GPU communication over multiple NICs.
☆21Updated last week
thustorage / Medusa
Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]
☆39Updated 6 months ago
alibaba-edu / qwen-bailian-usagetraces-anon
☆61Updated last month
microsoft / TE-CCL
☆42Updated last year
suquark / hoplite
☆44Updated 4 years ago
JF-D / Proteus
☆23Updated last year
H-Huang / torch_collective_extension
A minimum demo for PyTorch distributed extension functionality for collectives.
☆14Updated last year
uclasystem / bamboo
Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.
☆53Updated 2 years ago
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆103Updated 2 years ago
NEO-MLSys25 / NEO
NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading
☆69Updated 5 months ago
alpa-projects / mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
☆91Updated 2 years ago
SJTU-IPADS / reef-artifacts
A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.
☆43Updated 3 years ago
msr-fiddle / blox
☆44Updated last year
yuyangJin / PerFlow-AI
PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.
☆24Updated last week
mlcommons / chakra-old
Repository for MLCommons Chakra schema and tools
☆39Updated last year
eth-easl / sailor
AI model training on heterogeneous, geo-distributed resources
☆21Updated this week
dywsjtu / apparate
Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]
☆25Updated last year
Rivendile / Muri
Artifacts for our SIGCOMM'22 paper Muri
☆44Updated last year
alibaba / llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
☆63Updated last year
msr-fiddle / synergy
☆51Updated 2 years ago
phoenix-dataplane / mCCS
Managed collective communication service
☆22Updated last year
pkusys / ElasticFlow
Artifacts for our ASPLOS'23 paper ElasticFlow
☆55Updated last year
jasperzhong / swift
☆15Updated 3 years ago
LLMServe / dLoRA-artifact
☆27Updated last year
gudiandian / ElasticFlow
☆16Updated last year
UMass-LIDS / Proteus
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆12Updated last year
zhuangwang93 / Espresso
Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…
☆15Updated 2 years ago
spcl / rFaaS
rFaaS: a high-performance FaaS platform with RDMA acceleration for low-latency invocations.
☆57Updated 4 months ago