llmos-ai / llmosLinks
An Open Source, Cloud-native AI Infrastructure Platform. Not Just GPUs.
β47Updated last month
Alternatives and similar repositories for llmos
Users that are interested in llmos are comparing it to the libraries listed below
Sorting:
- π An awesome & curated list of best LLMOps tools.β154Updated this week
- A diverse, simple, and secure all-in-one LLMOps platformβ108Updated 11 months ago
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β252Updated this week
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.β205Updated 3 weeks ago
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β89Updated this week
- MCP server connecting to Kubernetesβ345Updated last week
- β¨Kubewizard is An AI-Agent for automated Kubernetes troubleshooting, and management, based on LangChain and k8s related tools.β25Updated 8 months ago
- Route LLM requests to the best model for the task at hand.β102Updated this week
- This is a landscape of the infrastructure that powers the generative AI ecosystemβ149Updated 11 months ago
- Containerization and cloud native suite for OPEAβ70Updated 3 weeks ago
- Inference scheduler for llm-dβ87Updated this week
- MCP Server for kubernetes management and diagnose your cluster and applicationsβ25Updated 3 months ago
- ποΈ Fine-tune, build, and deploy open-source LLMs easily!β475Updated this week
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Specβ38Updated this week
- Open-source MCP Gateway and AI Platformβ305Updated this week
- A toolkit for discovering cluster network topology.β67Updated this week
- π’ Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«β213Updated last week
- LM inference server implementation based on *.cpp.β274Updated last month
- β144Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.β125Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replicationβ572Updated this week
- A holistic framework to enable the design, development, and evaluation of autonomous AIOps agents.β682Updated this week
- OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)β269Updated this week
- Gateway API Inference Extensionβ475Updated this week
- Knowledge for GPTScriptβ29Updated 10 months ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β69Updated last month
- MaK(Mac+Kubernetes)llama - Running and orchestrating large language models (LLMs) on Kubernetes with macOS nodes.β42Updated last year
- Distributed KV cache coordinatorβ68Updated this week
- GPUd automates monitoring, diagnostics, and issue identification for GPUsβ425Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scaleβ801Updated this week