BaizeAI / kcoverLinks
π§― Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.
β33Updated last week
Alternatives and similar repositories for kcover
Users that are interested in kcover are comparing it to the libraries listed below
Sorting:
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β72Updated 4 months ago
- β71Updated this week
- A simulator of Kuberntes for batch and service workload.β50Updated 4 years ago
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β270Updated last week
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.β18Updated 6 months ago
- A workload for deploying LLM inference services on Kubernetesβ117Updated this week
- katalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This repoβ¦β51Updated last week
- π« A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIβ¦β24Updated 11 months ago
- β122Updated 3 years ago
- Distributed KV cache coordinatorβ88Updated this week
- Device plugins for Volcano, e.g. GPUβ129Updated 8 months ago
- Kubernetes Container Runtime Interface proxy service with hardware resource aware workload placement policiesβ178Updated 4 months ago
- A federation scheduler for multi-clusterβ58Updated 3 weeks ago
- The Volcano Deschedulerβ21Updated 10 months ago
- Inference scheduler for llm-dβ106Updated this week
- Device-plugin for volcano vgpu which support hard resource isolationβ131Updated 2 months ago
- β168Updated last month
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.β144Updated 3 years ago
- This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.β56Updated 8 months ago
- [Moved to https://github.com/kubernetes-sigs/kwok] This is a fake kubelet. that can simulate any number of nodes and maintain pods on thoβ¦β65Updated 3 years ago
- The API (CRD) of Volcanoβ49Updated last week
- A distributed system for Agentic AIβ32Updated last week
- [Moved to https://github.com/kubernetes-sigs/kwok] fake-k8s is a tool for running Fake Kubernetes clusters, It can be used as an alternatβ¦β19Updated 2 years ago
- Example DRA driver that developers can fork and modify to get them started writing their own.β109Updated last month
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policiesβ119Updated last week
- Following the same workflows as Kubernetes. Widely used in InftyAI community.β13Updated 4 months ago
- β32Updated 4 years ago
- HTTP based Tree-shaped Peer2Peer blob transfer proxy, distributing images or blob data.β25Updated 3 years ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, iβ¦β12Updated 2 years ago
- d.run websiteβ15Updated last week