llmos-ai / llmosLinks
An Open Source, Cloud-native AI Infrastructure Platform. Not Just GPUs.
☆54Updated 5 months ago
Alternatives and similar repositories for llmos
Users that are interested in llmos are comparing it to the libraries listed below
Sorting:
- A diverse, simple, and secure all-in-one LLMOps platform☆109Updated last year
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆286Updated last week
- 🎉 An awesome & curated list of best LLMOps tools.☆184Updated last week
- MaK(Mac+Kubernetes)llama - Running and orchestrating large language models (LLMs) on Kubernetes with macOS nodes.☆43Updated last year
- Route LLM requests to the best model for the task at hand.☆166Updated last week
- agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.☆702Updated last week
- MCP server connecting to Kubernetes☆368Updated 3 weeks ago
- ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing☆88Updated 2 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆232Updated 2 weeks ago
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling☆143Updated this week
- LM inference server implementation based on *.cpp.☆294Updated last month
- InferX: Inference as a Service Platform☆146Updated this week
- Distributed KV cache scheduling & offloading libraries☆98Updated this week
- ☆195Updated this week
- Kubernetes-native AI serving platform for scalable model serving.☆168Updated this week
- Run AI generated code in isolated sandboxes☆137Updated 11 months ago
- ✨Kubewizard is An AI-Agent for automated Kubernetes troubleshooting, and management, based on LangChain and k8s related tools.☆30Updated last year
- Complete MCP Platform -- Hosting, Registry, Gateway, and Chat Client☆562Updated this week
- 🏗️ Fine-tune, build, and deploy open-source LLMs easily!☆503Updated last week
- A toolkit for discovering cluster network topology.☆90Updated this week
- Inference scheduler for llm-d☆120Updated this week
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.☆92Updated 3 months ago
- NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes☆154Updated this week
- A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes…☆331Updated this week
- Device-plugin for volcano vgpu which support hard resource isolation☆142Updated 3 weeks ago
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆120Updated last month
- ☆274Updated this week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆25Updated last year
- This is a landscape of the infrastructure that powers the generative AI ecosystem☆152Updated last year
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆356Updated this week