Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you have any interests, please visit/star/fork https://github.com/PKU-DAIR/Hetu-Galvatron
☆23Oct 22, 2025Updated 4 months ago
Alternatives and similar repositories for Hetu-Galvatron
Users that are interested in Hetu-Galvatron are comparing it to the libraries listed below
Sorting:
- Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).☆177Jan 19, 2026Updated last month
- Accommodating Large Language Model Training over Heterogeneous Environment.☆25Mar 13, 2025Updated 11 months ago
- A high-performance distributed deep learning system targeting large-scale and automated distributed training.☆334Dec 13, 2025Updated 2 months ago
- A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, …☆124Dec 18, 2023Updated 2 years ago
- An efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture sear…☆64Nov 11, 2025Updated 3 months ago
- [ICML 2024] Serving LLMs on heterogeneous decentralized clusters.☆34May 6, 2024Updated last year
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆11May 6, 2023Updated 2 years ago
- 北京大学 2024 秋季学期编译原理课程 Lab 代码、笔记、经验☆16Sep 12, 2025Updated 5 months ago
- ☆26Feb 28, 2025Updated last year
- Pytorch--使用伪标签训练efficientNet模型☆11Dec 28, 2019Updated 6 years ago
- This repository contains the official implementation of the paper entitled with "FedAPEN: Personalized Cross-silo Federated Learning with…☆14Dec 4, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Feb 11, 2026Updated 3 weeks ago
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Nov 17, 2025Updated 3 months ago
- ☆10Apr 16, 2024Updated last year
- A simple LaTeX template for CUHK thesis.☆13Apr 24, 2023Updated 2 years ago
- See vLLM official support: https://github.com/vllm-project/vllm-ascend☆11Feb 5, 2025Updated last year
- ☆13Apr 7, 2025Updated 11 months ago
- The code for paper 'Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tr…☆21Aug 18, 2023Updated 2 years ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Sep 21, 2023Updated 2 years ago
- Repository for OpenCL codes.☆11Jul 30, 2015Updated 10 years ago
- ☆11Sep 7, 2024Updated last year
- The source code of paper "Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph" in KDD2022.☆15Jan 9, 2023Updated 3 years ago
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 5 months ago
- "JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)☆16Apr 7, 2025Updated 11 months ago
- Paper: "Aggregating Capacity in FL through Successive Layer Training for Computationally-Constrained Devices"☆18Jan 10, 2024Updated 2 years ago
- Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines☆19Dec 8, 2023Updated 2 years ago
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆48Feb 28, 2026Updated last week
- It contains Data Augmentaion, Strided convolution, Batch Normalization, Leaky Relu, Global Average pooling, L2 Regularization, learning …☆12Jun 3, 2018Updated 7 years ago
- PyTorch distributed training from scratch (for educational purposes only)☆21Apr 12, 2025Updated 10 months ago
- 数据结构与算法课的实验、作业代码,以及课堂ppt☆15Jan 10, 2019Updated 7 years ago
- AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images☆14May 14, 2018Updated 7 years ago
- ☆20Mar 26, 2025Updated 11 months ago
- Mobile Federated Learning development kit for FedCampus☆19Feb 3, 2024Updated 2 years ago
- The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”☆17Feb 26, 2024Updated 2 years ago
- A SQL Query Similarity Metric Benchmark☆16Apr 22, 2018Updated 7 years ago
- Zero Bubble Pipeline Parallelism☆451May 7, 2025Updated 10 months ago
- An Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales☆16Jun 6, 2024Updated last year
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆12Mar 7, 2024Updated 2 years ago
- AlphaJoin: Join Order Selection à la AlphaG☆16Apr 22, 2020Updated 5 years ago