yichuan-w / MLsys_reading_listLinks
A record of reading list on some MLsys popular topic
☆17Updated 9 months ago
Alternatives and similar repositories for MLsys_reading_list
Users that are interested in MLsys_reading_list are comparing it to the libraries listed below
Sorting:
- Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding☆62Updated 2 weeks ago
- [NeurIPS 2025] ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive☆50Updated last week
- [NeurIPS'25 Spotlight] Adaptive Attention Sparsity with Hierarchical Top-p Pruning☆75Updated 3 weeks ago
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆167Updated last year
- Systems for GenAI☆148Updated 8 months ago
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆66Updated last year
- Preview Code for Continuum Paper☆18Updated last week
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆21Updated 4 months ago
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆49Updated last year
- ☆125Updated last year
- ☆79Updated 3 years ago
- ☆54Updated 3 months ago
- ☆79Updated 2 months ago
- ☆15Updated last year
- MS108 Course Project, SJTU ACM Class.