AI-Infra-Team/awesome-papers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AI-Infra-Team/awesome-papers)

AI-Infra-Team / awesome-papers

Paper reading and discussion notes, covering AI frameworks, distributed systems, cluster management, etc.

☆68

Alternatives and similar repositories for awesome-papers

Users that are interested in awesome-papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tele-AI / TeleTron
View on GitHub
To pioneer training long-context multi-modal transformer models
☆74Aug 8, 2025Updated 11 months ago
FlexFusion / FlexFusion
View on GitHub
The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221
☆31Apr 22, 2025Updated last year
OpenSQZ / MegatronApp
View on GitHub
Toolchain built around the Megatron-LM for Distributed Training
☆96May 20, 2026Updated last month
TJU-NSL / awesome-papers
View on GitHub
☆37Updated this week
eth-easl / sailor
View on GitHub
AI model training on heterogeneous, geo-distributed resources
☆44Nov 24, 2025Updated 7 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
nex-agi / NexVenusCL
View on GitHub
Nex Venus Communication Library
☆75Nov 17, 2025Updated 7 months ago
InternLM / Awesome-LLM-Training-System
View on GitHub
☆54Aug 6, 2024Updated last year
PDZZXL / Awesome-LLM-Serving
View on GitHub
Large Language Model (LLM) Serving Paper and Resource List
☆29Apr 16, 2026Updated 2 months ago
CentML / Mist
View on GitHub
[EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization
☆24Apr 13, 2026Updated 2 months ago
LoongServe / LoongServe
View on GitHub
☆134Nov 11, 2024Updated last year
CGCL-codes / streambox
View on GitHub
☆18May 28, 2024Updated 2 years ago
OmniForcing / OmniForcing
View on GitHub
Official implementation of "OmniForcing: Unleashing Real-time Joint Audio-Visual Generation"[arXiv:2603.11647]. OmniForcing is the first …
☆165Jun 14, 2026Updated 3 weeks ago
LIUZhening111 / BiSIC
View on GitHub
Official Pytorch Implementation of Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model [ECCV'24]
☆22Dec 24, 2024Updated last year
Hanchenli / vllm-continuum
View on GitHub
Preview Code for Continuum Paper
☆86Jun 29, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zderaann / BlenderGLB2OBJ
View on GitHub
Blender script to convert glb files to obj with textures
☆11Jun 30, 2022Updated 4 years ago
TransferQueue / TransferQueue
View on GitHub
[Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…
☆16Jan 16, 2026Updated 5 months ago
IST-DASLab / HALO
View on GitHub
HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…
☆31Feb 17, 2025Updated last year
gofreelee / SpaceServe
View on GitHub
☆31Apr 8, 2026Updated 3 months ago
cds-ruc / sac
View on GitHub
SAC: A Co-Design Cache Algorithm for Emerging SMR-based High-Density Disks
☆13Jan 13, 2020Updated 6 years ago
PluralisResearch / AsyncPP
View on GitHub
Asynchronous pipeline parallel optimization
☆22Feb 2, 2026Updated 5 months ago
gudiandian / ElasticFlow
View on GitHub
☆17May 10, 2024Updated 2 years ago
GoodwillComputingLab / CLITE
View on GitHub
☆10Mar 14, 2020Updated 6 years ago
msr-fiddle / synergy
View on GitHub
☆54Dec 13, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
guanrenyang / Tiny-TPU
View on GitHub
☆10Dec 15, 2023Updated 2 years ago
jiangyinzuo / vimrc
View on GitHub
My personal vim/neovim configuration files, dotfiles, docs and other scripts.
☆15Jun 19, 2026Updated 2 weeks ago
DeepLink-org / DLSlime
View on GitHub
Composable and Embeddable Communication Runtime for Distributed AI Services
☆102Jun 5, 2026Updated last month
pkusys / ElasticFlow
View on GitHub
Artifacts for our ASPLOS'23 paper ElasticFlow
☆56May 10, 2024Updated 2 years ago
ByteDance-Seed / StragglerAnalysis
View on GitHub
☆55Apr 30, 2025Updated last year
HiEST / gpu-topo-aware
View on GitHub
GPU topology-aware scheduler
☆13Jul 7, 2017Updated 9 years ago
AmadeusChan / Awesome-LLM-System-Papers
View on GitHub
☆645Jan 14, 2026Updated 5 months ago
NetX-lab / Echo-slowdown
View on GitHub
Slowdown prediction module of Echo: Simulating Distributed Training at Scale
☆13May 17, 2025Updated last year
ardenma / implicit-gemm-tensor-core-convolution
View on GitHub
Simple example of how to write an Implicit GEMM Convolution in CUDA using the tensor core WMMA API and bindings for PyTorch.
☆19Jun 29, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
cds-ruc / CruiseDB
View on GitHub
An SLA-Oriented LSM-Tree Key-Value Store for High-end Cloud Data Service
☆17Jan 24, 2021Updated 5 years ago
byungsoo-oh / ml-systems-papers
View on GitHub
Curated collection of papers in machine learning systems
☆627Feb 7, 2026Updated 5 months ago
Chen-Binghao / PilotFish
View on GitHub
PilotFish harvests the free GPU cycles of cloud gaming with deep learning training
☆14Jul 2, 2022Updated 4 years ago
PASSIONLab / distributed_sddmm
View on GitHub
Distributed SDDMM Kernel
☆12Jul 8, 2022Updated 4 years ago
angohr / angohr
View on GitHub
Experiments with C++20, modules, packaging
☆15Oct 6, 2024Updated last year
argonne-lcf / LLM-Inference-Bench
View on GitHub
LLM-Inference-Bench
☆62Jul 18, 2025Updated 11 months ago
shanhaoli / pkuszdp
View on GitHub
LaTeX template for dissertation proposals in Peking University Shenzhen.
☆16Feb 23, 2022Updated 4 years ago