awslabs / slapoLinks

A schedule language for large model training

☆151

Alternatives and similar repositories for slapo

Users that are interested in slapo are comparing it to the libraries listed below

Sorting:

awslabs / raf
☆145Updated 10 months ago
awslabs / lorien
☆42Updated 2 years ago
uwsampl / SparseTIR
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆141Updated 2 years ago
zhuohan123 / terapipe
☆77Updated 4 years ago
tlc-pack / tenset
☆92Updated 3 years ago
parasailteam / coconet
☆83Updated 3 years ago
facebookexperimental / triton
Github mirror of trition-lang/triton repo.
☆100Updated this week
DachengLi1 / AMP
(NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.
☆43Updated 3 years ago
thu-pacman / PET
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆121Updated 3 years ago
UofT-EcoSystem / DietCode
DietCode Code Release
☆64Updated 3 years ago
microsoft / SparTA
☆159Updated last year
cmu-catalyst / collage
System for automated integration of deep learning backends.
☆47Updated 3 years ago
nox-410 / tvm.tl
An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.
☆51Updated last year
awslabs / ratex
☆23Updated 3 months ago
parasj / checkmate
Training neural networks in TensorFlow 2.0 with 5x less memory
☆137Updated 3 years ago
yifuwang / symm-mem-recipes
☆148Updated 11 months ago
spcl / substation
Research and development for optimizing transformers
☆131Updated 4 years ago
ParCIS / Chimera
Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.
☆68Updated 8 months ago
cchan / tccl
extensible collectives library in triton
☆91Updated 8 months ago
UDC-GAC / venom
A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
☆55Updated 2 years ago
saareliad / FTPipe
FTPipe and related pipeline model parallelism research.
☆43Updated 2 years ago
AlibabaResearch / flash-llm
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
☆224Updated 2 years ago
microsoft / nnscaler
nnScaler: Compiling DNN models for Parallel Training
☆120Updated 2 months ago
stanford-futuredata / stk
☆113Updated last year
amirgholami / ai_and_memory_wall
AI and Memory Wall
☆222Updated last year
alibaba / easydist
Automated Parallelization System and Infrastructure for Multiple Ecosystems
☆80Updated last year
ConnollyLeon / awesome-Auto-Parallelism
A baseline repository of Auto-Parallelism in Training Neural Networks
☆147Updated 3 years ago
apuaaChen / EVT_AE
Artifacts of EVT ASPLOS'24
☆28Updated last year
triton-lang / kernels
☆94Updated last year
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆114Updated 2 years ago