mosharaf / eecs598Links

Advanced Topics on Systems for X

☆277

Alternatives and similar repositories for eecs598

Users that are interested in eecs598 are comparing it to the libraries listed below

Sorting:

fanlai0990 / CS598
Systems for GenAI
☆142Updated 3 months ago
Hsword / Awesome-Machine-Learning-System-Papers
☆74Updated 3 years ago
lambda7xx / awesome-AI-system
paper and its code for AI System
☆318Updated 3 months ago
alpa-projects / mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
☆83Updated 2 years ago
netx-repo / PipeSwitch
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127Updated 3 years ago
HPMLL / BurstGPT
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
☆190Updated last week
Shenggan / awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
☆239Updated 9 months ago
mental2008 / awesome-papers
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and o…
☆115Updated this week
mosharaf / cse585
Advanced Scalable Systems for X
☆37Updated 8 months ago
SymbioticLab / Oobleck
A resilient distributed training framework
☆95Updated last year
msr-fiddle / CheckFreq
☆55Updated 4 years ago
Hsword / SpotServe
SpotServe: Serving Generative Large Language Models on Preemptible Instances
☆125Updated last year
alibaba / llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
☆62Updated last year
galeselee / Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…
☆264Updated 4 months ago
DicardoX / Research-Space
This repository is established to store personal notes and annotated papers during daily research.
☆138Updated this week
alibaba / ServeGen
A framework for generating realistic LLM serving workloads
☆51Updated last month
thustorage / Medusa
Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]
☆28Updated 2 months ago
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆96Updated 2 years ago
alibaba-edu / qwen-bailian-usagetraces-anon
☆39Updated last month
uclasystem / bamboo
Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.
☆50Updated 2 years ago
mutinifni / splitwise-sim
LLM serving cluster simulator
☆108Updated last year
microsoft / ParrotServe
[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable
☆170Updated 10 months ago
uw-mad-dash / shockwave
Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]
☆44Updated 2 years ago
WukLab / preble
Stateful LLM Serving
☆77Updated 4 months ago
jasperzhong / read-papers-and-code
My paper/code reading notes in Chinese
☆46Updated last month
byungsoo-oh / ml-systems-papers
Curated collection of papers in machine learning systems
☆388Updated last month
stanford-mast / INFaaS
Model-less Inference Serving
☆90Updated last year
pentium3 / sys_reading
system paper reading notes
☆246Updated 3 years ago
SJTU-IPADS / disb
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆53Updated 11 months ago
hongzhangblaze / CS854-F24
☆42Updated this week