PKU-DAIR / Hetu-Galvatron
Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).
☆77Updated this week
Alternatives and similar repositories for Hetu-Galvatron:
Users that are interested in Hetu-Galvatron are comparing it to the libraries listed below
- [ICML 2024] JSQ: Compressing Large Language Models by Joint Sparsification and Quantization☆147Updated 2 months ago
- MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆161Updated 3 months ago
- Tool-Planner: Dynamic Solution Tree Planning for Large Language Model with Tool Clustering☆102Updated 6 months ago
- ☆106Updated 2 years ago
- [NeurIPS'24] "Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration"☆163Updated 2 weeks ago
- Pytorch Implementation of "Sinkhorn Distance Minimization for Knowledge Distillation", COLING 2024 and TNNLS 2024☆119Updated last month
- Alchemy Cat —— 🔥Config System for SOTA☆116Updated last month
- Synthora is a lightweight and extensible framework for LLM-driven Agents and ALM research. It provides essential components to build, tes…☆66Updated this week
- 2024 MCM/ICM Problem E Outstanding Winner INFORMS Prize paper and source code (24美赛E题O奖冠名奖论文和源码)☆46Updated 5 months ago
- Adaptive Draft-Verification for Efficient Large Language Model Decoding☆60Updated last month
- MoH: Multi-Head Attention as Mixture-of-Head Attention☆188Updated 2 months ago
- ☆69Updated 3 weeks ago
- ☆333Updated last year
- EasyDeploy is engineered to provide users with end-to-end deployment capabilities for large-scale models.☆63Updated last week
- ☆166Updated 2 months ago
- [ECML-PKDD'24] Hyperbolic Contrastive Learning with Model-Augmentation for Knowledge-Aware Recommendation☆32Updated last month
- A collection of research papers on graph.☆27Updated 3 weeks ago
- A Takeout System Admin Frontend built with Vue2☆70Updated 2 months ago
- PyTorch implementation of the paper : Exploring Regional Clues in CLIP for Zero-Shot Semantic Segmentation.☆42Updated 6 months ago
- 给 Java 开发者看的 Kotlin 快速入门指南。☆167Updated 5 months ago
- My Notes and Course projects☆27Updated 4 months ago
- ☆116Updated 7 months ago
- Burned area segmentation☆71Updated 2 weeks ago
- 对接 OneBot11 标准 HTTP 的轻量智学网机器人☆50Updated this week
- AI辅助生成漂亮的时间轴,灵活呈现时空信息. A framework that allows for the flexible construction of time-space information displays.☆43Updated last month
- The deployment of the different NN algorithm on the WTM2101, this tutorial covers the entire process of deploying the nn algorithm on the…☆81Updated 3 months ago
- A quick tool and function collection designed specifically for projects using the JavaScript language☆15Updated this week
- 一个简化kube-apiserver的web api服务框架(基于go-restful封装)☆113Updated 5 months ago
- Convert sql to go structure, support gorm and xorm☆65Updated last month