AFDWang/Hetu-Galvatron

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AFDWang/Hetu-Galvatron)

AFDWang / Hetu-Galvatron

Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you have any interests, please visit/star/fork https://github.com/PKU-DAIR/Hetu-Galvatron

☆25

Alternatives and similar repositories for Hetu-Galvatron

Users that are interested in Hetu-Galvatron are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PKU-DAIR / Hetu
View on GitHub
A high-performance distributed deep learning system targeting large-scale and automated distributed training.
☆339Dec 13, 2025Updated 7 months ago
Hsword / Hetu
View on GitHub
A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, …
☆126Dec 18, 2023Updated 2 years ago
Relaxed-System-Lab / HexiScale
View on GitHub
Accommodating Large Language Model Training over Heterogeneous Environment.
☆32Mar 13, 2025Updated last year
wangchang327 / compiler-lab-test-driver
View on GitHub
编译原理课程实践中用于测试的代码
☆10Jun 9, 2021Updated 5 years ago
Relaxed-System-Lab / HexGen
View on GitHub
[ICML 2024] Serving LLMs on heterogeneous decentralized clusters.
☆37May 6, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pritamqu / HALVA
View on GitHub
[ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination
☆21Jan 27, 2025Updated last year
PKU-DAIR / SGL
View on GitHub
A scalable graph learning toolkit for extremely large graph datasets. (WWW'22, 🏆 Best Student Paper Award)
☆158May 10, 2024Updated 2 years ago
Hsword / Awesome-Machine-Learning-System-Papers
View on GitHub
☆80Mar 7, 2022Updated 4 years ago
KejiaZhang-Robust / AI-Agent-papers
View on GitHub
Collection of recent works on AI Agents.
☆17Jun 5, 2025Updated last year
smackers / whoop
View on GitHub
automatic data race analysis for Linux device drivers
☆12Jul 27, 2016Updated 10 years ago
thomas-young-2013 / open-box
View on GitHub
Generalized and Efficient Blackbox Optimization System.
☆86Feb 21, 2023Updated 3 years ago
AICrossSim / PLENA_Simulator
View on GitHub
☆22Updated this week
ruz048 / AutoLoRA
View on GitHub
☆10Apr 16, 2024Updated 2 years ago
codecaution / EvoMoE
View on GitHub
☆21Oct 31, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
MingSun-Tse / Good-DA-in-KD
View on GitHub
[NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective
☆37Dec 15, 2022Updated 3 years ago
gsampler9 / gSampler
View on GitHub
☆29Aug 14, 2024Updated last year
UBOdin / EttuBench
View on GitHub
A SQL Query Similarity Metric Benchmark
☆16Apr 22, 2018Updated 8 years ago
unist-ssl / IIDP
View on GitHub
☆13Apr 7, 2025Updated last year
pittisl / ElasticTrainer
View on GitHub
Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)
☆14Nov 1, 2023Updated 2 years ago
AkideLiu / MiniCache
View on GitHub
☆14Sep 7, 2024Updated last year
unist-ssl / JABAS
View on GitHub
"JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)
☆16Apr 7, 2025Updated last year
iLearn-Lab / AAAI26-H-GAR
View on GitHub
[AAAI 2026] H-GAR: A Hierarchical Interaction Framework via Goal-Driven Observation-Action Reffnement for Robotic Manipulation
☆32Nov 28, 2025Updated 8 months ago
cosdt / vllm-ascend
View on GitHub
See vLLM official support: https://github.com/vllm-project/vllm-ascend
☆11Feb 5, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
PKU-Baichuan-MLSystemLab / SysBench
View on GitHub
SysBench: Can Large Language Models Follow System Messages?
☆40Sep 4, 2024Updated last year
mingukkang / MNIST-Tensorflow-Code
View on GitHub
It contains Data Augmentaion, Strided convolution, Batch Normalization, Leaky Relu, Global Average pooling, L2 Regularization, learning …
☆12Jun 3, 2018Updated 8 years ago
wangchang327 / latex-templates
View on GitHub
日常事务 LaTeX 懒人包
☆36Sep 4, 2022Updated 3 years ago
leokhoa / Open-DocLLM
View on GitHub
☆16Apr 3, 2024Updated 2 years ago
awslabs / optimizing-multitask-training-through-dynamic-pipelines
View on GitHub
Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
☆19Dec 8, 2023Updated 2 years ago
softsys4ai / readingroup
View on GitHub
☆11Aug 21, 2018Updated 7 years ago
BioX-NKU / scBackdoor
View on GitHub
Backdoor attacks in single-cell pretrained models
☆34Aug 17, 2025Updated 11 months ago
sail-sg / zero-bubble-pipeline-parallelism
View on GitHub
Zero Bubble Pipeline Parallelism
☆464May 7, 2025Updated last year
bytedance / QSync
View on GitHub
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Feb 23, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
lim142857 / Sparsifiner
View on GitHub
Demo code for CVPR2023 paper "Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers"
☆15Jul 4, 2023Updated 3 years ago
shengshu-ai / TurboServe
View on GitHub
TurboServe: Serving Streaming Video Generation Efficiently and Economically
☆38Jul 12, 2026Updated 2 weeks ago
MOSSVENC / WebChat2Api
View on GitHub
DeepSeek 网页 API 反代
☆19May 3, 2026Updated 2 months ago
listentm / CROWDSELECT
View on GitHub
We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…
☆20May 20, 2025Updated last year
Triang-jyed-driung / i8muon
View on GitHub
Muon in Int8 Precision Made Possible
☆20Jun 18, 2026Updated last month
nanduan / nanduan.github.io
View on GitHub
☆10Jul 6, 2026Updated 3 weeks ago
awslabs / Lancet-Accelerating-MoE-Training-via-Whole-Graph-Computation-Communication-Overlapping
View on GitHub
Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…
☆14May 20, 2026Updated 2 months ago