HFAiLab / hai-platform-studioLinks
配合 HAI Platform 使用的集成化用户界面
☆54Updated 2 years ago
Alternatives and similar repositories for hai-platform-studio
Users that are interested in hai-platform-studio are comparing it to the libraries listed below
Sorting:
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆91Updated last year
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆737Updated 2 years ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆274Updated 6 months ago
- ☆114Updated last year
- OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development…☆99Updated 4 years ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated 2 years ago
- 一站式自动化开源标注平台☆78Updated 3 years ago
- ☆25Updated 2 years ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆60Updated last year
- ☆123Updated 11 months ago
- ☆183Updated last week
- ☆35Updated 4 years ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆154Updated last month
- A CLI for Kubeflow.☆60Updated 2 years ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆17Updated 2 years ago
- 国产加速卡-海光DCU实战(大模型训练、微调、推理 等)☆67Updated 5 months ago
- ElasticCTR,即飞桨弹性计算推荐系统,是基于Kubernetes的企业级推荐系统开源解决方案。该方案融合了百度业务场景下持续打磨的高精度CTR模型、飞桨开源框架的大规模分布式训练能力、工业级稀疏参数弹性调度服务,帮助用户在Kubernetes环境中一键完成推荐系统部…☆187Updated 5 years ago
- Transformer related optimization, including BERT, GPT☆17Updated 2 years ago
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆315Updated 6 months ago
- LLM 推理服务性能测试☆44Updated 2 years ago
- 让算法工程化更简单☆96Updated 10 months ago
- ☆79Updated 2 years ago
- A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.☆27Updated last year
- ☆219Updated 2 years ago
- ☆130Updated last year
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆61Updated last year
- ☆74Updated this week
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆251Updated last year
- A high performance, high expansion, easy to use framework for AI application. 为AI应用的开发者提供一套统一的高性能、易用的编程框架,快速基于AI全栈服务、开发跨端边云的AI行业应用,支持GPU,…☆160Updated last year