llmos-ai / llmosLinks

An Open Source, Cloud-native AI Infrastructure Platform. Not Just GPUs.

☆45

Alternatives and similar repositories for llmos

Users that are interested in llmos are comparing it to the libraries listed below

Sorting:

InftyAI / Awesome-LLMOps
🎉 An awesome & curated list of best LLMOps tools.
☆141Updated this week
kubeagi / arcadia
A diverse, simple, and secure all-in-one LLMOps platform
☆107Updated 10 months ago
InftyAI / llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
☆228Updated last week
gpustack / gguf-parser-go
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
☆189Updated last week
llmariner / llmariner
Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.
☆87Updated this week
NVIDIA / topograph
A toolkit for discovering cluster network topology.
☆59Updated last week
nekomeowww / ollama-operator
🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫
☆198Updated this week
sozercan / aikit
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
☆465Updated last week
NVIDIA-AI-Blueprints / llm-router
Route LLM requests to the best model for the task at hand.
☆87Updated last month
microsoft / AIOpsLab
A holistic framework to enable the design, development, and evaluation of autonomous AIOps agents.
☆647Updated this week
qingwave / kubewizard
✨Kubewizard is An AI-Agent for automated Kubernetes troubleshooting, and management, based on LangChain and k8s related tools.
☆24Updated 6 months ago
gptscript-ai / knowledge
Knowledge for GPTScript
☆29Updated 9 months ago
NVIDIA / k8s-nim-operator
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆119Updated this week
llm-d / llm-d-inference-scheduler
Inference scheduler for llm-d
☆68Updated last week
strowk / mcp-k8s-go
MCP server connecting to Kubernetes
☆331Updated this week
gpustack / llama-box
LM inference server implementation based on *.cpp.
☆242Updated this week
kubernetes-sigs / lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
☆526Updated this week
run-ai / runai-model-streamer
☆231Updated this week
NVIDIA / KAI-Scheduler
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
☆723Updated this week
obot-platform / obot
Open source AI Agent Platform
☆198Updated this week
opea-project / GenAIInfra
Containerization and cloud native suite for OPEA
☆70Updated this week
tensorchord / ai-infra-landscape
This is a landscape of the infrastructure that powers the generative AI ecosystem
☆148Updated 9 months ago
llm-d / llm-d-kv-cache-manager
Distributed KV cache coordinator
☆43Updated last week
NVIDIA / knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆69Updated 2 weeks ago
wenhuwang / mcp-k8s-eye
MCP Server for kubernetes management and diagnose your cluster and applications
☆23Updated 2 months ago
bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆138Updated last week
run-ai / fake-gpu-operator
☆131Updated 2 weeks ago
Project-HAMi / volcano-vgpu-device-plugin
Device-plugin for volcano vgpu which support hard resource isolation
☆96Updated last month
InftyAI / Manta
💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…
☆24Updated 7 months ago
kubernetes-sigs / gateway-api-inference-extension
Gateway API Inference Extension
☆415Updated this week