HFAiLab / hai-platformLinks

一种任务级GPU算力分时调度的高性能深度学习训练平台

☆699

Alternatives and similar repositories for hai-platform

Users that are interested in hai-platform are comparing it to the libraries listed below

Sorting:

Tencent / KsanaLLM
☆503Updated last month
alibaba / rtp-llm
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
☆874Updated last week
vllm-project / vllm-ascend
Community maintained hardware plugin for vLLM on Ascend
☆1,179Updated last week
FlagOpen / FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
☆358Updated last week
thu-pacman / chitu
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
☆1,293Updated this week
intelligent-machine-learning / dlrover
DLRover: An Automatic Distributed Deep Learning System
☆1,561Updated last week
HFAiLab / hai-platform-studio
配合 HAI Platform 使用的集成化用户界面
☆53Updated 2 years ago
modelscope / dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …
☆265Updated 2 months ago
antgroup / glake
GLake: optimizing GPU memory management and IO transmission.
☆479Updated 6 months ago
volcengine / veGiantModel
☆219Updated 2 years ago
mindspore-lab / mindformers
☆174Updated this week
Ascend / pytorch
Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch
☆437Updated 3 weeks ago
ninehills / llm-inference-benchmark
LLM Inference benchmark
☆426Updated last year
alibaba / EasyParallelLibrary
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
☆268Updated 2 years ago
alibaba / Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
☆1,367Updated this week
volcengine / ml-platform-sdk-python
☆32Updated 2 years ago
PaddlePaddle / PaddleFlow
☆123Updated 7 months ago
FlagOpen / FlagPerf
FlagPerf is an open-source software platform for benchmarking AI chips.
☆352Updated 2 months ago
bytedance / flux
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
☆1,137Updated last month
kvcache-ai / Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
☆4,054Updated this week
modelscope / evalscope
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
☆1,762Updated last week
volcengine / veScale
A PyTorch Native LLM Training Framework
☆872Updated 3 weeks ago
alibaba / Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
☆659Updated last year
LLMServe / DistServe
Disaggregated serving system for Large Language Models (LLMs).
☆700Updated 6 months ago
4paradigm / k8s-vgpu-scheduler
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow app…
☆575Updated last year
virtaitech / orion
☆277Updated 2 years ago
DeepLink-org / dlinfer
☆63Updated last month
antgroup / llm-oss-landscape
Open Source Landscapes and Insights Produced by AntOSS
☆216Updated last week
alibaba / ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
☆428Updated this week
ForceInjection / AI-fundermentals
AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识
☆518Updated this week