llmos-ai / llmosLinks
An Open Source, Cloud-native AI Infrastructure Platform. Not Just GPUs.
β50Updated 2 months ago
Alternatives and similar repositories for llmos
Users that are interested in llmos are comparing it to the libraries listed below
Sorting:
- π An awesome & curated list of best LLMOps tools.β165Updated last week
- A diverse, simple, and secure all-in-one LLMOps platformβ108Updated last year
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β261Updated last week
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.β211Updated 2 months ago
- β¨Kubewizard is An AI-Agent for automated Kubernetes troubleshooting, and management, based on LangChain and k8s related tools.β27Updated 9 months ago
- LM inference server implementation based on *.cpp.β286Updated 2 months ago
- A holistic framework to enable the design, development, and evaluation of autonomous AIOps agents.β712Updated last week
- Route LLM requests to the best model for the task at hand.β113Updated last month
- π’ Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«β219Updated last week
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β90Updated 2 weeks ago
- Run AI generated code in isolated sandboxesβ112Updated 8 months ago
- Docker Model Runnerβ192Updated this week
- Containerization and cloud native suite for OPEAβ70Updated 2 weeks ago
- ποΈ Fine-tune, build, and deploy open-source LLMs easily!β483Updated this week
- Device-plugin for volcano vgpu which support hard resource isolationβ113Updated last month
- ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation β¨ and compute time-slicingβ81Updated last year
- β152Updated this week
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Specβ45Updated this week
- Inference scheduler for llm-dβ99Updated last week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scaleβ869Updated this week
- Chat to deploy and manage applications on any infrastructureβ126Updated last year
- Open-source MCP Gateway and AI Platformβ413Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β70Updated 3 months ago
- MCP server connecting to Kubernetesβ353Updated last month
- Distributed KV cache coordinatorβ79Updated last week
- A Site Reliability Engineer AI agent that can monitor application and infrastructure logs, diagnose issues, and report on diagnostics.β104Updated 3 weeks ago
- Self-host LLMs with vLLM and BentoMLβ152Updated 2 weeks ago
- NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetesβ144Updated this week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.β130Updated this week
- MCP Server for kubernetes management and diagnose your cluster and applicationsβ25Updated 5 months ago