Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas - Effortless optimization at its finest!
☆683Apr 21, 2024Updated 2 years ago
Alternatives and similar repositories for nos
Users that are interested in nos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NVIDIA device plugin for Kubernetes☆49Feb 16, 2024Updated 2 years ago
- DRA Driver for NVIDIA GPUs☆662Updated this week
- NVIDIA device plugin for Kubernetes☆3,797Updated this week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,533Dec 29, 2023Updated 2 years ago
- A collection of libraries to optimise AI model performances☆8,336Jul 22, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Kubernetes-native Job Queueing☆2,593Jun 24, 2026Updated last week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,752Updated this week
- Heterogeneous GPU Sharing on Kubernetes☆3,623Updated this week
- A Cloud Native Batch System (Project under CNCF)☆5,714Updated this week
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆202Nov 22, 2023Updated 2 years ago
- Run Slurm in Kubernetes☆398Updated this week
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes☆5,610Jun 23, 2026Updated last week
- A kubernetes operator for creating and managing a cache of container images directly on the cluster worker nodes, so application pods sta…☆1,370Feb 20, 2024Updated 2 years ago
- MIG Partition Editor for NVIDIA GPUs☆255Jun 23, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Kubectl Sockperf plugin - Latency Measurement in Kubernetes☆22Nov 26, 2022Updated 3 years ago
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,339Jun 23, 2026Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆329Jun 23, 2026Updated last week
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,213Jun 24, 2026Updated last week
- A Kubernetes plugin that gives context to what is restarting in your Kubernetes cluster☆155Sep 10, 2025Updated 9 months ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,297Jun 24, 2026Updated last week
- Kpad is a simple multiplatform terminal editor born to edit kubernetes declarative manifest yaml files.☆44Oct 10, 2023Updated 2 years ago
- Practical GPU Sharing Without Memory Size Constraints☆313Mar 28, 2025Updated last year
- Multi-tenancy and policy-based framework for Kubernetes.☆2,116Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Multi-cluster Kubernetes usage analytics for CPU, Memory, and GPU — track costs and optimize cluster resources☆63Mar 18, 2025Updated last year
- Automatically taint nodes and evict pods based on cpu pressure☆51Dec 23, 2022Updated 3 years ago
- Resource-adaptive cluster scheduler for deep learning training.☆459Mar 5, 2023Updated 3 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆528Updated this week
- ☆900Apr 2, 2024Updated 2 years ago
- Enable dynamic and seamless Kubernetes multi-cluster topologies☆1,454Jun 24, 2026Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆79Apr 14, 2026Updated 2 months ago
- A Kubernetes controller for automatically optimizing pod requests based on their continuous usage. VPA alternative that can work with HPA…☆207Feb 9, 2024Updated 2 years ago
- K8s device plugin for GPU sharing☆100May 10, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Jan 11, 2023Updated 3 years ago
- A light library to allow changing pod log level without restarting the pod☆12Jul 29, 2023Updated 2 years ago
- Cost monitoring for Kubernetes workloads and cloud costs☆6,608Updated this week
- Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration☆5,506Updated this week
- A Topology-Aware Custom Scheduler For Kubernetes☆65Jul 5, 2023Updated 2 years ago
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ cl…☆10,224Updated this week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,755Mar 23, 2026Updated 3 months ago