InftyAI / llmazLinks

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

☆273

Alternatives and similar repositories for llmaz

Users that are interested in llmaz are comparing it to the libraries listed below

Sorting:

volcano-sh / volcano-global
A federation scheduler for multi-cluster
☆58Updated last month
Project-HAMi / volcano-vgpu-device-plugin
Device-plugin for volcano vgpu which support hard resource isolation
☆131Updated last week
kubernetes-sigs / lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
☆620Updated last week
volcano-sh / kthena
☆79Updated this week
volcano-sh / devices
Device plugins for Volcano, e.g. GPU
☆129Updated 8 months ago
kubernetes-sigs / gateway-api-inference-extension
Gateway API Inference Extension
☆537Updated this week
run-ai / fake-gpu-operator
☆172Updated this week
InftyAI / Manta
💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…
☆24Updated last year
sgl-project / rbg
A workload for deploying LLM inference services on Kubernetes
☆123Updated this week
sgl-project / ome
OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)
☆324Updated this week
NVIDIA / knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆72Updated 4 months ago
Project-HAMi / HAMi-core
HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container
☆257Updated last week
DataTunerX / datatunerx
Large language model fine-tuning capabilities based on cloud native and distributed computing.
☆92Updated last year
BaizeAI / kcover
🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.
☆33Updated last week
InftyAI / Awesome-LLMOps
🎉 An awesome & curated list of best LLMOps tools.
☆172Updated last week
llm-d / llm-d-inference-scheduler
Inference scheduler for llm-d
☆109Updated last week
ai-dynamo / grove
Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
☆119Updated this week
kubernetes-sigs / jobset
JobSet: a k8s native API for distributed ML training and HPC workloads
☆286Updated last week
NVIDIA / k8s-dra-driver-gpu
NVIDIA DRA Driver for GPUs
☆504Updated this week
elastic-ai / elastic-gpu
Using CRDs to manage GPU resources in Kubernetes.
☆209Updated 3 years ago
volcano-sh / descheduler
The Volcano Descheduler
☆21Updated 10 months ago
volcano-sh / apis
The API (CRD) of Volcano
☆49Updated 2 weeks ago
kubernetes-sigs / agent-sandbox
agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.
☆443Updated this week
kosmos-io / kosmos
The limitless expansion of Kubernetes. Make Kubernetes without boundaries
☆256Updated 5 months ago
llm-d / llm-d-kv-cache-manager
Distributed KV cache coordinator
☆91Updated last week
Mellanox / k8s-rdma-shared-dev-plugin
☆315Updated last week
leptonai / gpud
GPUd automates monitoring, diagnostics, and issue identification for GPUs
☆461Updated this week
llm-d-incubation / llm-d-infra
llm-d helm charts and deployment examples
☆47Updated 2 weeks ago
kubewharf / godel-scheduler
a unified scheduler for online and offline tasks
☆632Updated 8 months ago
kube-queue / kube-queue
☆122Updated 3 years ago