volcano-sh / kthenaLinks
β57Updated this week
Alternatives and similar repositories for kthena
Users that are interested in kthena are comparing it to the libraries listed below
Sorting:
- π« A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIβ¦β24Updated 11 months ago
- Distributed KV cache coordinatorβ82Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β70Updated 3 months ago
- π§― Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.β33Updated 3 weeks ago
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Specβ45Updated this week
- agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.β126Updated last week
- β17Updated last week
- Inference scheduler for llm-dβ102Updated this week
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprisesβ24Updated 6 months ago
- Example DRA driver that developers can fork and modify to get them started writing their own.β96Updated last week
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β263Updated last week
- Incubating P/D sidecar for llm-dβ16Updated this week
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, iβ¦β12Updated 2 years ago
- A distributed system for Elastic Workloadβ31Updated last month
- A simulator of Kuberntes for batch and service workload.β49Updated 4 years ago
- DRANET is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding apβ¦β128Updated last week
- A workload for deploying LLM inference services on Kubernetesβ93Updated this week
- An OCM addon that automates the installation of Kubernetes' konnectivity servers and agents.β49Updated last week
- d.run websiteβ15Updated this week
- GenAI inference performance benchmarking toolβ107Updated last week
- The Volcano Deschedulerβ20Updated 9 months ago
- Dragonfly Helm Chartsβ41Updated last week
- API Extensions for core KubeVela.β14Updated 3 weeks ago
- ControllerMesh is a solution that helps developers manage their controllers/operators better with enhanced isolation.β63Updated 2 years ago
- Manage K8S like managing local filesβ29Updated 2 years ago
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.β18Updated 5 months ago
- Canonical location of the Dragonfly API definitionβ12Updated last week
- Holistic job manager on Kubernetesβ116Updated last year
- β17Updated 4 months ago
- katalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This repoβ¦β50Updated 3 weeks ago