llmos-ai / llmosLinks
An Open Source, Cloud-native AI Infrastructure Platform. Not Just GPUs.
β51Updated 3 months ago
Alternatives and similar repositories for llmos
Users that are interested in llmos are comparing it to the libraries listed below
Sorting:
- π An awesome & curated list of best LLMOps tools.β167Updated last month
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β267Updated this week
- A diverse, simple, and secure all-in-one LLMOps platformβ109Updated last year
- Route LLM requests to the best model for the task at hand.β125Updated 2 weeks ago
- Docker Model Runnerβ218Updated last week
- MaK(Mac+Kubernetes)llama - Running and orchestrating large language models (LLMs) on Kubernetes with macOS nodes.β43Updated last year
- Open-source MCP Gateway and AI Platformβ445Updated this week
- A toolkit for discovering cluster network topology.β81Updated this week
- π’ Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«β223Updated this week
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β90Updated last month
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.β215Updated 2 months ago
- OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)β307Updated last week
- π« A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIβ¦β24Updated 11 months ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β71Updated 3 months ago
- A holistic framework to enable the design, development, and evaluation of autonomous AIOps agents.β729Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.β134Updated this week
- Distributed KV cache coordinatorβ85Updated this week
- Self-host LLMs with vLLM and BentoMLβ156Updated 2 weeks ago
- agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.β135Updated last week
- β¨Kubewizard is An AI-Agent for automated Kubernetes troubleshooting, and management, based on LangChain and k8s related tools.β27Updated 10 months ago
- A Site Reliability Engineer AI agent that can monitor application and infrastructure logs, diagnose issues, and report on diagnostics.β108Updated last month
- β159Updated 3 weeks ago
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscalingβ86Updated last week
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β276Updated 2 years ago
- β23Updated 2 weeks ago
- Device-plugin for volcano vgpu which support hard resource isolationβ128Updated last month
- LM inference server implementation based on *.cpp.β289Updated 3 months ago
- MCP server connecting to Kubernetesβ355Updated this week
- Inference scheduler for llm-dβ103Updated this week
- This is a landscape of the infrastructure that powers the generative AI ecosystemβ149Updated last year