HFAiLab / hai-platform
一种任务级GPU算力分时调度的高性能深度学习训练平台
☆491Updated last year
Alternatives and similar repositories for hai-platform:
Users that are interested in hai-platform are comparing it to the libraries listed below
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆629Updated last month
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆266Updated last year
- FlagScale is a large model toolkit based on open-sourced projects.☆223Updated this week
- ☆314Updated last month
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆851Updated last week
- GLake: optimizing GPU memory management and IO transmission.☆431Updated 2 months ago
- ☆213Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆303Updated last week
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆442Updated this week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆231Updated last week
- LLM Inference benchmark☆392Updated 6 months ago
- ☆152Updated this week
- FlagPerf is an open-source software platform for benchmarking AI chips.☆321Updated 2 weeks ago
- HFAI deep learning models☆131Updated last year
- ☆120Updated this week
- Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch☆303Updated this week
- DLRover: An Automatic Distributed Deep Learning System☆1,340Updated this week
- Best practice for training LLaMA models in Megatron-LM☆644Updated last year
- A Survey of AI startups☆395Updated last year
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆398Updated last month
- ☆299Updated 8 months ago
- 配合 HAI Platform 使用的集成化用户界面☆35Updated last year
- Disaggregated serving system for Large Language Models (LLMs).☆466Updated 6 months ago
- FireFlyer Record file format, writer and reader for DL training samples.☆139Updated 2 years ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆469Updated 11 months ago
- ☆127Updated last month
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆238Updated 11 months ago
- ☆30Updated 2 years ago
- ☆273Updated last year