asplos-contest / 2025Links

The ASPLOS 2025 / EuroSys 2025 Contest Track

☆37

Alternatives and similar repositories for 2025

Users that are interested in 2025 are comparing it to the libraries listed below

Sorting:

Hsword / Awesome-Machine-Learning-System-Papers
☆77Updated 3 years ago
alibaba-edu / qwen-bailian-usagetraces-anon
☆55Updated 4 months ago
google / iopddl
Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning
☆23Updated 5 months ago
parasailteam / coconet
☆83Updated 2 years ago
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆101Updated 2 years ago
NEO-MLSys25 / NEO
NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading
☆67Updated 4 months ago
Raphael-Hao / brainstorm
Compiler for Dynamic Neural Networks
☆46Updated last year
LLMServe / SwiftTransformer
High performance Transformer implementation in C++.
☆138Updated 9 months ago
yuyangJin / PerFlow-AI
PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.
☆24Updated 2 weeks ago
HPMLL / BurstGPT
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
☆215Updated 3 months ago
open-neutrino / neutrino
☆194Updated 2 months ago
apache / tvm-ffi
Open ABI and FFI for Machine Learning Systems
☆138Updated this week
infinigence / FlashOverlap
A lightweight design for computation-communication overlap.
☆181Updated 2 weeks ago
HPMLL / NVIDIA-Hopper-Benchmark
☆61Updated 4 months ago
microsoft / nnscaler
nnScaler: Compiling DNN models for Parallel Training
☆117Updated last month
sitar-lab / NeuSight
☆53Updated 4 months ago
SJTU-IPADS / ugache
☆23Updated last year
KuangjuX / NVSHMEM-Tutorial
NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
☆139Updated last month
humuyan / Korch
ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch
☆37Updated 7 months ago
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆150Updated 9 months ago
ColfaxResearch / layout-categories
This repository contains companion software for the Colfax Research paper "Categorical Foundations for CuTe Layouts".
☆69Updated last month
zhaiyi000 / tlm
☆44Updated last year
AlibabaResearch / mononn
☆31Updated last year
wu-kan / GoPTX
GoPTX: Fine-grained GPU Kernel Fusion by PTX-level Instruction Flow Weaving
☆18Updated 2 months ago
thustorage / Medusa
Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]
☆33Updated 5 months ago
alibaba / llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
☆62Updated last year
mutinifni / splitwise-sim
LLM serving cluster simulator
☆116Updated last year
pku-liang / MAGIS
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆55Updated last year
ParCIS / Chimera
Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.
☆67Updated 7 months ago
Azure / msccl
Microsoft Collective Communication Library
☆66Updated 11 months ago