mental2008/awesome-papers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mental2008/awesome-papers)

mental2008 / awesome-papers

Here are my personal paper reading notes (including machine learning systems, AI infrastructure, and other interesting stuffs).

☆207

Alternatives and similar repositories for awesome-papers

Users that are interested in awesome-papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / MetaOpt
View on GitHub
MetaOpt: Towards efficient heuristic design with quantifiable and confident performance
☆23Jan 20, 2026Updated 5 months ago
lambda7xx / awesome-AI-system
View on GitHub
paper and its code for AI System
☆373May 14, 2026Updated last month
msr-fiddle / blox
View on GitHub
☆46Jul 4, 2024Updated 2 years ago
NEO-MLSys25 / NEO
View on GitHub
NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading
☆99Jun 16, 2025Updated last year
cslab-ntua / artificial-matrix-generator
View on GitHub
An artificial matrix generator in C
☆13Feb 16, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
S-Lab-System-Group / Lucid
View on GitHub
Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
☆60May 21, 2023Updated 3 years ago
AmberLJC / LLMSys-PaperList
View on GitHub
Large Language Model (LLM) Systems Paper List
☆2,167Jun 21, 2026Updated 2 weeks ago
LLMServe / dLoRA-artifact
View on GitHub
☆32May 28, 2024Updated 2 years ago
xpan413 / FSMoE
View on GitHub
☆16Jan 14, 2025Updated last year
michaelzhiluo / starburst
View on GitHub
Burstable Cloud Scheduler
☆17Jun 6, 2024Updated 2 years ago
terminal-agent / reptile
View on GitHub
💻 Terminal-Agent with Human-in-the-Loop Learning
☆40Jan 16, 2026Updated 5 months ago
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
microsoft / ParrotServe
View on GitHub
[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable
☆221Sep 21, 2024Updated last year
msr-fiddle / philly-traces
View on GitHub
☆199Aug 31, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hao-ai-lab / cse234-w25
View on GitHub
Website for CSE 234, Winter 2025
☆16Mar 24, 2025Updated last year
Azure / msccl
View on GitHub
Microsoft Collective Communication Library
☆66Nov 23, 2024Updated last year
CalvinXKY / mfu_calculation
View on GitHub
A simple calculation for LLM MFU.
☆78Sep 10, 2025Updated 9 months ago
microsoft / taccl
View on GitHub
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches
☆82Jul 25, 2023Updated 2 years ago
clusterfarmem / cfm
View on GitHub
Cluster Far Mem, framework to execute single job and multi job experiments using fastswap
☆21Jan 12, 2024Updated 2 years ago
kooyunmo / cuda-uvm-gpt2
View on GitHub
PyTorch-UVM on super-large language models.
☆17Dec 21, 2020Updated 5 years ago
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,476Jun 25, 2026Updated 2 weeks ago
SJTU-IPADS / PhoenixOS
View on GitHub
Fast OS-level support for GPU checkpoint and restore
☆285Sep 28, 2025Updated 9 months ago
EfficientLLMSys / MuxServe
View on GitHub
☆15Jun 26, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
microsoft / vattention
View on GitHub
Dynamic Memory Management for Serving LLMs without PagedAttention
☆500Jun 10, 2026Updated 3 weeks ago
S-Lab-System-Group / Awesome-DL-Scheduling-Papers
View on GitHub
☆332Jan 22, 2024Updated 2 years ago
alibaba / GPU-scheduler-for-deep-learning
View on GitHub
GPU-scheduler-for-deep-learning
☆213Nov 5, 2020Updated 5 years ago
microsoft / msccl
View on GitHub
Microsoft Collective Communication Library
☆393Sep 20, 2023Updated 2 years ago
NetX-lab / Echo
View on GitHub
Simulating Distributed Training at Scale
☆14Sep 15, 2025Updated 9 months ago
flashinfer-ai / flashinfer
View on GitHub
FlashInfer: Kernel Library for LLM Serving
☆5,896Updated this week
parasailteam / coconet
View on GitHub
☆85Dec 2, 2022Updated 3 years ago
DicardoX / Research-Space
View on GitHub
This repository is established to store personal notes and annotated papers during daily research.
☆199Jun 28, 2026Updated last week
spcl / muliticast-based-allgather
View on GitHub
☆24Feb 12, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CSU-NetLab / A2TP-Eurosys2023
View on GitHub
☆12Mar 13, 2023Updated 3 years ago
Thesys-lab / Helix-ASPLOS25
View on GitHub
Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
☆93Oct 15, 2025Updated 8 months ago
Ascend / TransferQueue
View on GitHub
An asynchronous streaming data management module for efficient post-training.
☆106Updated this week
clusterfarmem / clustersim
View on GitHub
Cluster simulator with far memory
☆12Apr 28, 2020Updated 6 years ago
eth-easl / orion
View on GitHub
An interference-aware scheduler for fine-grained GPU sharing
☆162Nov 26, 2025Updated 7 months ago
c3sr / tcu_scope
View on GitHub
☆50Jun 27, 2019Updated 7 years ago
mlcommons / chakra-old
View on GitHub
Repository for MLCommons Chakra schema and tools
☆38Dec 24, 2023Updated 2 years ago