pprp/ultrascale-playbook-zh

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pprp/ultrascale-playbook-zh)

pprp / ultrascale-playbook-zh

UltraScale Playbook 中文版

☆167

Alternatives and similar repositories for ultrascale-playbook-zh

Users that are interested in ultrascale-playbook-zh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Victarry / PyTorch-Memory-Profiler
View on GitHub
☆47Sep 8, 2025Updated 10 months ago
huggingface / nanotron
View on GitHub
Minimalistic large language model 3D-parallelism training
☆2,766May 26, 2026Updated 2 months ago
huggingface / picotron
View on GitHub
Minimalistic 4D-parallelism distributed training framework for education purpose
☆2,258Aug 26, 2025Updated 11 months ago
antgroup / DeepXTrace
View on GitHub
DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.
☆101Jan 16, 2026Updated 6 months ago
skindhu / How-To-Scale-Your-Model-CN
View on GitHub
《How to Scale Your Model》中文翻译项目 - 智能技术文档翻译工具。专为大语言模型扩展技术书籍设计，突破长文档翻译瓶颈，完美保留数学公式、代码块格式。采用占位符机制+分层翻译策略，基于Gemini API提供高质量翻译。Python+crawl4ai技…
☆188Aug 30, 2025Updated 10 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
sean-wade / Yolox_augment
View on GitHub
Add some features to yolox
☆24Jan 12, 2023Updated 3 years ago
Mohan-Sai / Temperature-prediction-using-Machine-Learning
View on GitHub
Predicting the temperature of your system based on factors such as RAM usage,CPU storage temperature,Memory Used and space consumed by th…
☆11Apr 23, 2019Updated 7 years ago
BBuf / tensorrt-llm-moe
View on GitHub
☆34Feb 3, 2025Updated last year
liangyuwang / Tiny-DeepSpeed
View on GitHub
Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library
☆53Aug 20, 2025Updated 11 months ago
zhaochenyang20 / Awesome-ML-SYS-Tutorial
View on GitHub
My learning notes for ML SYS.
☆6,772Updated this week
lewin4 / DNN-Partition
View on GitHub
DNN partition edge-cloud co-infer
☆11Jun 11, 2023Updated 3 years ago
SLDGroup / GradientFilter-CVPR23
View on GitHub
☆13Sep 25, 2023Updated 2 years ago
GeeeekExplorer / nano-vllm
View on GitHub
Nano vLLM
☆14,635Apr 26, 2026Updated 3 months ago
BBuf / how-to-optim-algorithm-in-cuda
View on GitHub
how to optimize some algorithm in cuda.
☆3,147Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Infrasys-AI / AIInfra
View on GitHub
AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。
☆7,714Dec 22, 2025Updated 7 months ago
toyaix / triton-runner
View on GitHub
Multi-Level Triton Runner supporting Python, IR, PTX, AMDGCN, cubin and hasco.
☆98May 8, 2026Updated 2 months ago
Tencent / KsanaLLM
View on GitHub
☆545Jul 14, 2026Updated last week
AXERA-TECH / SAM-ONNX-AX650-CPP
View on GitHub
☆18Dec 7, 2023Updated 2 years ago
xlite-dev / LeetCUDA
View on GitHub
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
☆11,631Updated this week
serhatarslan-hub / HomaL4Protocol-ns-3
View on GitHub
NS3 implementation of Homa Transport Protocol
☆24Dec 14, 2025Updated 7 months ago
BBuf / how-to-learn-deep-learning-framework
View on GitHub
how to learn PyTorch and OneFlow
☆502May 20, 2026Updated 2 months ago
UCSB-NLP-Chang / SemanticSmooth
View on GitHub
Implementation of paper 'Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing'
☆24Jun 9, 2024Updated 2 years ago
bytedance / flux
View on GitHub
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
☆1,345Aug 28, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OpenMOSS / Thus-Spake-Long-Context-LLM
View on GitHub
a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation
☆62Mar 31, 2025Updated last year
liangyuwang / Tiny-Megatron
View on GitHub
Tiny-Megatron, a minimalistic re-implementation of the Megatron library
☆32Sep 1, 2025Updated 10 months ago
xlite-dev / Awesome-LLM-Inference
View on GitHub
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
☆5,416Jun 23, 2026Updated last month
wk-ff / GTC
View on GitHub
reimplement of "GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition"
☆15Nov 10, 2020Updated 5 years ago
StigLidu / AdaExplore
View on GitHub
The official implementation for paper "AdaExplore: Failure-Driven Adaptation and Diversity-Preserving Search for Efficient Kernel Generat…
☆22Jul 12, 2026Updated 2 weeks ago
st01tyy / LightScale
View on GitHub
Lightweight and Scalable Post-training: The Ray-Free, Debug-Friendly Alignment Stack with Megatron-native simplicity.
☆54May 20, 2026Updated 2 months ago
flashinfer-ai / flashinfer
View on GitHub
FlashInfer: Kernel Library for LLM Serving
☆6,032Updated this week
AldenWangExis / CMU-LLM-Course
View on GitHub
☆18Mar 1, 2026Updated 4 months ago
NVIDIA-NeMo / Megatron-Bridge
View on GitHub
Training library for Megatron-based models with bidirectional Hugging Face conversion capability
☆829Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gau-nernst / learn-cuda
View on GitHub
Learn CUDA with PyTorch
☆354Jun 1, 2026Updated last month
mlc-ai / pith-train
View on GitHub
Compact and Agent-Native MoE Training System
☆299Updated this week
kvcache-ai / Mooncake
View on GitHub
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
☆5,999Updated this week
arjundevraj / stragglar
View on GitHub
☆15Oct 2, 2025Updated 9 months ago
snap-stanford / crust
View on GitHub
[NeurIPS 2020] Coresets for Robust Training of Neural Networks against Noisy Labels
☆36May 2, 2021Updated 5 years ago
chengaopro / AZHP
View on GitHub
☆16Jun 12, 2024Updated 2 years ago
jichilen / SAR_quick
View on GitHub
☆19Jun 25, 2019Updated 7 years ago