makllama / makllamaLinks
MaK(Mac+Kubernetes)llama - Running and orchestrating large language models (LLMs) on Kubernetes with macOS nodes.
β42Updated last year
Alternatives and similar repositories for makllama
Users that are interested in makllama are comparing it to the libraries listed below
Sorting:
- π An awesome & curated list of best LLMOps tools.β141Updated this week
- A diverse, simple, and secure all-in-one LLMOps platformβ107Updated 10 months ago
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β228Updated last week
- π’ Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«β198Updated this week
- GPUd automates monitoring, diagnostics, and issue identification for GPUsβ405Updated this week
- A toolkit for discovering cluster network topology.β59Updated last week
- Open Weight, tool-calling LLMsβ154Updated 9 months ago
- Run native macOS workloads on Kubernetesβ302Updated 2 months ago
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β270Updated last year
- Your AI Kubernetes Expertβ183Updated 2 years ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β69Updated 2 weeks ago
- β231Updated this week
- This is a landscape of the infrastructure that powers the generative AI ecosystemβ148Updated 9 months ago
- β131Updated 2 weeks ago
- MCP server connecting to Kubernetesβ331Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replicationβ526Updated last week
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Specβ32Updated this week
- This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.β50Updated 4 months ago
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β87Updated this week
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.β142Updated 2 years ago
- Kuberentes LangChain Agent - Interact with Kubernetes Clusters using LLMsβ26Updated 2 years ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.β119Updated this week
- Distributed KV cache coordinatorβ43Updated last week
- NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetesβ137Updated last week
- πͺΆ Lightweight OpenAI drop-in replacement for Kubernetesβ146Updated last year
- Gateway API Inference Extensionβ415Updated this week
- ποΈ Fine-tune, build, and deploy open-source LLMs easily!β465Updated last week
- β254Updated last month
- β162Updated last week
- GenAI inference performance benchmarking toolβ71Updated this week