ModelEngine-Group / flexaiLinks
☆30Updated last week
Alternatives and similar repositories for flexai
Users that are interested in flexai are comparing it to the libraries listed below
Sorting:
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆269Updated last week
- tony k8s device-plugin,一个简单的 k8s device-plugin 实现以及原理分析教程。☆29Updated 10 months ago
- ☆538Updated last year
- ☆86Updated last week
- ☆891Updated last year
- The IX device plugin is a DaemonSet for Kubernetes, which can help to expose the Iluvatar GPU in the Kubernetes cluster.☆17Updated 2 months ago
- ☆331Updated last week
- a unified scheduler for online and offline tasks☆639Updated 10 months ago
- Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)☆2,945Updated this week
- A workload for deploying LLM inference services on Kubernetes☆160Updated last week
- AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识☆742Updated last week
- A federation scheduler for multi-cluster☆61Updated this week
- An open-source kit for agent development, integrated the powerful capabilities of Volcengine.☆256Updated this week
- Arks is a cloud-native inference framework running on Kubernetes☆45Updated 2 weeks ago
- Continue to follow kubernetes source code and analyze the source code implementation☆149Updated 3 years ago
- OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow app…☆584Updated last year
- Device plugins for Volcano, e.g. GPU☆131Updated 10 months ago
- Rapid and cost-effective operator and best practice for agent sandbox lifecycle management.☆88Updated this week
- AI Cluster Observability & Troubleshooting Toolkit. Powered by SII & Infrawaves.☆32Updated this week
- Kubernetes Operator for AI and Bigdata Elastic Training☆90Updated last year
- ☆60Updated last month
- kubernetes device plugin的开发示例☆36Updated 5 years ago
- Persist and reuse KV Cache to speedup your LLM.☆244Updated this week
- vLLM Kunlun (vllm-kunlun) is a community-maintained hardware plugin designed to seamlessly run vLLM on the Kunlun XPU.☆239Updated this week
- Using CRDs to manage GPU resources in Kubernetes.☆210Updated 3 years ago
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆654Updated last week
- ☆106Updated last week
- Provides deploy scripts and CSI for Lustre.☆14Updated 3 months ago
- Hooked CUDA-related dynamic libraries by using automated code generation tools.☆172Updated 2 years ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆532Updated last year