Relaxed-System-Lab / COMP4901Y_Course_HKUST

Course Material for the UG Course COMP4901Y

☆52

Alternatives and similar repositories for COMP4901Y_Course_HKUST:

Users that are interested in COMP4901Y_Course_HKUST are comparing it to the libraries listed below

October2001 / Awesome-KV-Cache-Compression
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).
☆276Updated 2 weeks ago
fanlai0990 / CS598
Systems for GenAI
☆85Updated this week
Relaxed-System-Lab / COMP6211J_Course_HKUST
☆39Updated last month
Zefan-Cai / Awesome-LLM-KV-Cache
Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.
☆197Updated last month
hao-ai-lab / vllm-ltr
[NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank
☆34Updated 2 months ago
hongzhangblaze / CS854-F24
☆34Updated 2 months ago
Hsword / Awesome-Machine-Learning-System-Papers
☆62Updated 2 years ago
microsoft / ParrotServe
[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable
☆136Updated 4 months ago
TreeAI-Lab / Awesome-KV-Cache-Management
☆45Updated 3 weeks ago
YaoJiayi / CacheBlend
☆70Updated 3 weeks ago
HPMLL / BurstGPT
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
☆142Updated 3 months ago
hemingkx / Spec-Bench
Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)
☆218Updated 3 months ago
galeselee / Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…
☆213Updated last month
JF-D / Proteus
☆17Updated 6 months ago
thu-pacman / FasterMoE
☆72Updated 2 years ago
LoongServe / LoongServe
☆82Updated 2 months ago
lambda7xx / awesome-AI-system
paper and its code for AI System
☆262Updated this week
LiuXiaoxuanPKU / OSD
☆40Updated last month
PKU-DAIR / Starter-Guide
A comprehensive guide for beginners in the field of data management and artificial intelligence.
☆142Updated 2 months ago
zhengzangw / Sequence-Scheduling
PyTorch implementation of paper "Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline".
☆81Updated last year
MoE-Inf / awesome-moe-inference
Curated collection of papers in MoE model inference
☆41Updated last week
UChi-JCL / CacheGen
☆84Updated 3 months ago
mosharaf / cse585
Advanced Scalable Systems for X
☆29Updated last month
MLSys-Learner-Resources / Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
☆153Updated 3 weeks ago
openpsi-project / ReaLHF
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
☆205Updated 2 weeks ago
henryzhongsc / longctx_bench
Official implementation for Yuan & Liu & Zhong et al., KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark o…
☆61Updated 3 weeks ago
UbiquitousLearning / Efficient_Foundation_Model_Survey
Survey Paper List - Efficient LLM and Foundation Models
☆238Updated 4 months ago
Guangxuan-Xiao / GSM8K-eval
☆29Updated last year
d-matrix-ai / keyformer-llm
☆51Updated 10 months ago
yuandong-tian / arXiv_recbot
A Telegram bot to recommend arXiv papers
☆237Updated 3 weeks ago