makllama / makllamaLinks

MaK(Mac+Kubernetes)llama - Running and orchestrating large language models (LLMs) on Kubernetes with macOS nodes.

☆42

Alternatives and similar repositories for makllama

Users that are interested in makllama are comparing it to the libraries listed below

Sorting:

InftyAI / Awesome-LLMOps
🎉 An awesome & curated list of best LLMOps tools.
☆141Updated this week
kubeagi / arcadia
A diverse, simple, and secure all-in-one LLMOps platform
☆107Updated 10 months ago
InftyAI / llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
☆228Updated last week
nekomeowww / ollama-operator
🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫
☆198Updated this week
leptonai / gpud
GPUd automates monitoring, diagnostics, and issue identification for GPUs
☆405Updated this week
NVIDIA / topograph
A toolkit for discovering cluster network topology.
☆59Updated last week
rubra-ai / rubra
Open Weight, tool-calling LLMs
☆154Updated 9 months ago
agoda-com / macOS-vz-kubelet
Run native macOS workloads on Kubernetes
☆302Updated 2 months ago
tensorchord / openmodelz
Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)
☆270Updated last year
knight42 / kopilot
Your AI Kubernetes Expert
☆183Updated 2 years ago
NVIDIA / knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆69Updated 2 weeks ago
run-ai / runai-model-streamer
☆231Updated this week
tensorchord / ai-infra-landscape
This is a landscape of the infrastructure that powers the generative AI ecosystem
☆148Updated 9 months ago
run-ai / fake-gpu-operator
☆131Updated 2 weeks ago
strowk / mcp-k8s-go
MCP server connecting to Kubernetes
☆331Updated this week
kubernetes-sigs / lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
☆526Updated last week
modelpack / modctl
Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec
☆32Updated this week
chaunceyjiang / fake-gpu
This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.
☆50Updated 4 months ago
llmariner / llmariner
Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.
☆87Updated this week
elastic-ai / elastic-gpu-scheduler
elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.
☆142Updated 2 years ago
jjoneson / k8s-langchain
Kuberentes LangChain Agent - Interact with Kubernetes Clusters using LLMs
☆26Updated 2 years ago
NVIDIA / k8s-nim-operator
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆119Updated this week
llm-d / llm-d-kv-cache-manager
Distributed KV cache coordinator
☆43Updated last week
NVIDIA / vgpu-device-manager
NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes
☆137Updated last week
chenhunghan / ialacol
🪶 Lightweight OpenAI drop-in replacement for Kubernetes
☆146Updated last year
kubernetes-sigs / gateway-api-inference-extension
Gateway API Inference Extension
☆415Updated this week
sozercan / aikit
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
☆465Updated last week
cncf-tags / container-device-interface
☆254Updated last month
NVIDIA / nvkind
☆162Updated last week
kubernetes-sigs / inference-perf
GenAI inference performance benchmarking tool
☆71Updated this week