makllama / makllamaLinks
MaK(Mac+Kubernetes)llama - Running and orchestrating large language models (LLMs) on Kubernetes with macOS nodes.
β43Updated last year
Alternatives and similar repositories for makllama
Users that are interested in makllama are comparing it to the libraries listed below
Sorting:
- π An awesome & curated list of best LLMOps tools.β167Updated last month
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β267Updated this week
- agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.β135Updated last week
- π’ Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«β223Updated this week
- A diverse, simple, and secure all-in-one LLMOps platformβ109Updated last year
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.β134Updated this week
- Distributed KV cache coordinatorβ85Updated this week
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Specβ47Updated this week
- π« A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIβ¦β24Updated 11 months ago
- GPUd automates monitoring, diagnostics, and issue identification for GPUsβ450Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β71Updated 3 months ago
- This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.β56Updated 8 months ago
- MCP server connecting to Kubernetesβ355Updated this week
- Run native macOS workloads on Kubernetesβ328Updated last month
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprisesβ24Updated 6 months ago
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β90Updated last month
- β159Updated 3 weeks ago
- This is a landscape of the infrastructure that powers the generative AI ecosystemβ149Updated last year
- NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetesβ148Updated this week
- β¨Kubewizard is An AI-Agent for automated Kubernetes troubleshooting, and management, based on LangChain and k8s related tools.β27Updated 10 months ago
- A toolkit for discovering cluster network topology.β81Updated this week
- β179Updated this week
- Inference scheduler for llm-dβ103Updated this week
- An Open Source, Cloud-native AI Infrastructure Platform. Not Just GPUs.β51Updated 3 months ago
- π§― Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.β33Updated last week
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β276Updated 2 years ago
- δΈε½εΌεθ ζ΄»ε¨ζ₯η¨οΌε ³ζ³¨ηΉοΌεΌζΊγεΌεθ γδΊεηοΌβ22Updated this week
- llm-d helm charts and deployment examplesβ46Updated last month
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscalingβ86Updated last week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replicationβ611Updated this week