DicardoX / Research-SpaceLinks

This repository is established to store personal notes and annotated papers during daily research.

☆165

Alternatives and similar repositories for Research-Space

Users that are interested in Research-Space are comparing it to the libraries listed below

Sorting:

HPMLL / BurstGPT
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
☆220Updated 4 months ago
mental2008 / awesome-papers
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and o…
☆137Updated last month
chenhongyu2048 / LLM-inference-optimization-paper
Summary of some awesome work for optimizing LLM inference
☆139Updated this week
lambda7xx / awesome-AI-system
paper and its code for AI System
☆339Updated 3 months ago
Hsword / Awesome-Machine-Learning-System-Papers
☆79Updated 3 years ago
galeselee / Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…
☆282Updated 8 months ago
snu-comparch / InfiniGen
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
☆163Updated last year
byungsoo-oh / ml-systems-papers
Curated collection of papers in machine learning systems
☆463Updated 2 weeks ago
mutinifni / splitwise-sim
LLM serving cluster simulator
☆122Updated last year
S-Lab-System-Group / Awesome-DL-Scheduling-Papers
☆313Updated last year
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆153Updated last week
alpa-projects / mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
☆91Updated 2 years ago
Thesys-lab / Helix-ASPLOS25
Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
☆73Updated last month
LoongServe / LoongServe
☆124Updated last year
LLMServe / dLoRA-artifact
☆27Updated last year
AI-Infra-Team / awesome-papers
Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.
☆43Updated 3 weeks ago
LLMServe / SwiftTransformer
High performance Transformer implementation in C++.
☆142Updated 10 months ago
alibaba / llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
☆63Updated last year
MoE-Inf / awesome-moe-inference
Curated collection of papers in MoE model inference
☆308Updated last month
pkusys / ElasticFlow
Artifacts for our ASPLOS'23 paper ElasticFlow
☆55Updated last year
Raphael-Hao / Abacus
☆38Updated 5 months ago
Hsword / SpotServe
SpotServe: Serving Generative Large Language Models on Preemptible Instances
☆132Updated last year
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆103Updated 2 years ago
alibaba-edu / qwen-bailian-usagetraces-anon
☆61Updated last month
JF-D / Parcae
☆21Updated last year
James-QiuHaoran / LLM-serving-with-proxy-models
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | A tiny BERT model can tell you the verbosity of an …
☆49Updated last year
Relaxed-System-Lab / HexGen
[ICML 2024] Serving LLMs on heterogeneous decentralized clusters.
☆31Updated last year
ConnollyLeon / awesome-Auto-Parallelism
A baseline repository of Auto-Parallelism in Training Neural Networks
☆147Updated 3 years ago
S-Lab-System-Group / Lucid
Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
☆58Updated 2 years ago
NetX-lab / Ayo
[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo
☆53Updated 3 months ago