[IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any interests, please visit/star/fork https://github.com/Youhe-Jiang/OptimalShardedDataParallel
☆52May 31, 2023Updated 2 years ago
Alternatives and similar repositories for IJCAI2023-OptimalShardedDataParallel
Users that are interested in IJCAI2023-OptimalShardedDataParallel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DELTA-pytorch:DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation☆12Apr 16, 2024Updated 2 years ago
- ☆20Oct 31, 2022Updated 3 years ago
- ☆24Jul 7, 2024Updated last year
- ☆85Dec 2, 2022Updated 3 years ago
- High performance NCCL plugin for Bagua.☆15Sep 15, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you hav…☆24Oct 22, 2025Updated 6 months ago
- ☆14Jan 12, 2022Updated 4 years ago
- Bagua tutorials.☆13Sep 4, 2022Updated 3 years ago
- ☆78May 4, 2021Updated 5 years ago
- A high-performance distributed deep learning system targeting large-scale and automated distributed training.☆335Dec 13, 2025Updated 4 months ago
- A schedule language for large model training☆152Aug 21, 2025Updated 8 months ago
- Zero Bubble Pipeline Parallelism☆452May 7, 2025Updated last year
- A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, …☆124Dec 18, 2023Updated 2 years ago
- An implementation of parameter server framework in PyTorch RPC.☆11Nov 12, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Distributed IO-aware Attention algorithm☆24Sep 24, 2025Updated 7 months ago
- ☆84Feb 11, 2026Updated 2 months ago
- Sequence-level 1F1B schedule for LLMs.☆19Jun 4, 2024Updated last year
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆44Nov 4, 2022Updated 3 years ago
- Core communication lib for Bagua.☆48Sep 15, 2021Updated 4 years ago
- ☆23Aug 20, 2025Updated 8 months ago
- Yinghan's Code Sample