microsoft/nnscaler

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/nnscaler)

microsoft / nnscaler

nnScaler: Compiling DNN models for Parallel Training

☆135

Alternatives and similar repositories for nnscaler

Users that are interested in nnscaler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / SuperScaler
View on GitHub
An experimental parallel training platform
☆57Mar 25, 2024Updated 2 years ago
awslabs / optimizing-multitask-training-through-dynamic-pipelines
View on GitHub
Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
☆19Dec 8, 2023Updated 2 years ago
kungfu-team / tenplex
View on GitHub
Dynamic resources changes for multi-dimensional parallelism training
☆31Aug 22, 2025Updated 10 months ago
microsoft / TrainVerify
View on GitHub
A verification tool for ensuring parallelization equivalence in distributed model training.
☆17Sep 1, 2025Updated 10 months ago
jiazhihao / attention_superoptimizer
View on GitHub
An Attention Superoptimizer
☆22Jan 20, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
parasailteam / coconet
View on GitHub
☆85Dec 2, 2022Updated 3 years ago
tile-ai / tilescale
View on GitHub
Tile-based language built for AI computation across all scales
☆173Jun 16, 2026Updated last month
microsoft / ParrotServe
View on GitHub
[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable
☆222Sep 21, 2024Updated last year
tile-ai / AttentionEngine
View on GitHub
☆52May 19, 2025Updated last year
zhuohan123 / terapipe
View on GitHub
☆79May 4, 2021Updated 5 years ago
volcengine / veScale
View on GitHub
Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs
☆1,031Mar 3, 2026Updated 4 months ago
awslabs / Lancet-Accelerating-MoE-Training-via-Whole-Graph-Computation-Communication-Overlapping
View on GitHub
Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…
☆14May 20, 2026Updated 2 months ago
sail-sg / zero-bubble-pipeline-parallelism
View on GitHub
Zero Bubble Pipeline Parallelism
☆462May 7, 2025Updated last year
DachengLi1 / AMP
View on GitHub
(NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.
☆44Nov 4, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
microsoft / TileFusion
View on GitHub
TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.
☆115Jun 28, 2025Updated last year
HPDL-Group / Merak
View on GitHub
☆86Feb 11, 2026Updated 5 months ago
JF-D / Parcae
View on GitHub
☆22Apr 22, 2024Updated 2 years ago
Raphael-Hao / brainstorm
View on GitHub
Compiler for Dynamic Neural Networks
☆45Nov 13, 2023Updated 2 years ago
LoongServe / LoongServe
View on GitHub
☆135Nov 11, 2024Updated last year
microsoft / Tutel
View on GitHub
Tutel MoE: Optimized Mixture-of-Experts Library, Support GptOss/DeepSeek/Kimi-K2/Qwen3 using FP8/NVFP4/MXFP4
☆996Jul 8, 2026Updated last week
microsoft / SparTA
View on GitHub
☆167Jul 22, 2024Updated last year
zhisbug / Cavs
View on GitHub
Cavs: An Efficient Runtime System for Dynamic Neural Networks
☆15Sep 18, 2020Updated 5 years ago
saareliad / FTPipe
View on GitHub
FTPipe and related pipeline model parallelism research.
☆44May 16, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
UMass-LIDS / Proteus
View on GitHub
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆13Mar 7, 2024Updated 2 years ago
ByteDance-Seed / StragglerAnalysis
View on GitHub
☆56Apr 30, 2025Updated last year
microsoft / FractalTensor
View on GitHub
FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …
☆32Dec 21, 2024Updated last year
AlibabaPAI / DAPPLE
View on GitHub
An Efficient Pipelined Data Parallel Approach for Training Large Model
☆76Dec 11, 2020Updated 5 years ago
lhb8125 / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆19Jul 9, 2026Updated last week
tile-ai / TileFoundry
View on GitHub
☆54Updated this week
awslabs / slapo
View on GitHub
A schedule language for large model training
☆153Aug 21, 2025Updated 11 months ago
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,494Updated this week
ParCIS / Chimera
View on GitHub
Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.
☆72Mar 20, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NASP-THU / multiverse
View on GitHub
GPU-accelerated LLM Training Simulator
☆52Jun 26, 2025Updated last year
stepfun-ai / StepMesh
View on GitHub
☆377Jan 28, 2026Updated 5 months ago
alibaba / hap
View on GitHub
☆16Apr 13, 2024Updated 2 years ago
heheda12345 / MagPy
View on GitHub
☆41Jun 5, 2024Updated 2 years ago
microsoft / TileIR
View on GitHub
☆31Feb 28, 2025Updated last year
TiledTensor / TiledCUDA
View on GitHub
We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …
☆192Jan 28, 2025Updated last year
microsoft / AttentionEngine
View on GitHub
☆123May 19, 2025Updated last year