💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIX promise 🎯
☆27Dec 6, 2024Updated last year
Alternatives and similar repositories for Manta
Users that are interested in Manta are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13May 31, 2026Updated 3 weeks ago
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆306Jan 26, 2026Updated 5 months ago
- WG Serving☆38Mar 24, 2026Updated 3 months ago
- 中国开发者活动日程(关注点:开源、开发者、云原生)☆25Updated this week
- Kubernetes APIServer 高性能代理组件,代理 APIServer 的 List 请求,其它类型的请求会直接反向代理到原生 APIServer。 CKube 还额外支持了分页、搜索和索引等功能。 并且,CKube 100% 兼容原生 kubectl 和 ku…☆19Sep 16, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An open-source SSH and Kubernetes configuration management tool designed for macOS users.☆43Aug 25, 2025Updated 10 months ago
- 🎉 An awesome & curated list of best LLMOps tools.☆244Updated this week
- The Volcano Descheduler☆24Jan 24, 2025Updated last year
- Kubernetes 源码学习笔记 🔭☆23Apr 5, 2022Updated 4 years ago
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆748Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆329Updated this week
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆13May 16, 2023Updated 3 years ago
- CPU DRA Driver☆56Updated this week
- data plane testing utility of cloud native☆223Jun 15, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Simplified Data Management and Sharing for Kubernetes☆18Jun 22, 2026Updated last week
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆35Updated this week
- Distributed KV cache scheduling & offloading libraries☆157Updated this week
- GaussDB driver and toolkit for Go☆15Dec 17, 2025Updated 6 months ago
- [Moved to https://github.com/kubernetes-sigs/kwok] This is a fake kubelet. that can simulate any number of nodes and maintain pods on tho…☆67Jul 20, 2022Updated 3 years ago
- Device-plugin for volcano vgpu which support hard resource isolation☆161Jun 9, 2026Updated 2 weeks ago
- Gateway API Inference Extension☆699Jun 17, 2026Updated last week
- 围绕云原生知识体系,收集一些不错的文章。仅供学习参考。☆52Jul 11, 2022Updated 3 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Prototypes and experiments for WG Device Management.☆16May 21, 2026Updated last month
- Federated middleware based on Karmada☆49Nov 20, 2023Updated 2 years ago
- A workload for deploying LLM inference services on Kubernetes☆243Updated this week
- d.run website☆17Jun 10, 2026Updated 2 weeks ago
- 实现对多个k8s集群的informer监听功能☆13Nov 25, 2023Updated 2 years ago
- Experimental DRA driver bringing CNI closer to Kubernetes☆44Oct 1, 2025Updated 8 months ago
- Package mount defines an interface to mounting filesystems.☆72Updated this week
- The xline-operator is a powerful tool designed to automate the process of bootstrapping, monitoring, snapshotting, and recovering an xlin…☆16Feb 19, 2024Updated 2 years ago
- Operator for the mutating admission webhook for ClusterResourceOverride☆19Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Low level generic controller framework☆59Updated this week
- A collection of useful Go libraries for use with NVIDIA GPU management tools☆56Jun 21, 2026Updated last week
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆27Apr 24, 2025Updated last year
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆470Updated this week
- An Open-source, self-hosted AI model hub with Hugging Face compatibility, accelerating vLLM/SGLang performance.☆252Updated this week
- Product ready cluster lifecycle management toolchains based on kubespray and other cluster LCM engine.☆525Jun 22, 2026Updated last week
- Layer4 egress gateway for Kubernetes☆299May 29, 2026Updated last month