GPU environment and cluster management with LLM support
☆661May 16, 2024Updated 2 years ago
Alternatives and similar repositories for genv
Users that are interested in genv are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A top-like tool for monitoring GPUs in a cluster☆85Feb 14, 2024Updated 2 years ago
- ☆312May 28, 2026Updated last week
- ☆266Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,293Updated this week
- Tensors, for human consumption☆1,386Apr 9, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing☆93Mar 12, 2026Updated 2 months ago
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ cl…☆10,070Updated this week
- ☆804Apr 28, 2026Updated last month
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆311Updated this week
- Vendor-agnostic orchestration for training, inference and agentic workloads across NVIDIA, AMD, TPU, and Tenstorrent on clouds, Kubernete…☆2,152Jun 2, 2026Updated last week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,723Jun 1, 2026Updated last week
- A Datacenter Scale Distributed Inference Serving Framework☆7,200Updated this week
- Practical GPU Sharing Without Memory Size Constraints☆313Mar 28, 2025Updated last year
- GPUd automates monitoring, diagnostics, and issue identification for GPUs☆482Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- DRA Driver for NVIDIA GPUs☆651Updated this week
- GPU plugin to the node feature discovery for Kubernetes☆309May 27, 2024Updated 2 years ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆27Apr 24, 2025Updated last year
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆53Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,733Updated this week
- A JupyterLab extension for displaying dashboards of GPU usage.☆670Updated this week
- ☆12Aug 27, 2024Updated last year
- NVIDIA device plugin for Kubernetes☆3,782Updated this week
- ☆203Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆732Updated this week
- Tools for building GPU clusters☆1,442Jun 2, 2026Updated last week
- Simple, safe way to store and distribute tensors☆3,763Jun 1, 2026Updated last week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆528Jun 2, 2026Updated last week
- Python client for the Run:ai REST API☆25Dec 15, 2025Updated 5 months ago
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆147Nov 21, 2022Updated 3 years ago
- Build and run containers leveraging NVIDIA GPUs☆4,381Jun 2, 2026Updated last week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,533Dec 29, 2023Updated 2 years ago
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,203Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆81,909Updated this week
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes☆5,541Updated this week
- A toolkit to run Ray applications on Kubernetes☆2,534Updated this week
- Containers for machine learning☆9,418Updated this week
- ☆20Apr 12, 2026Updated last month
- Heterogeneous GPU Sharing on Kubernetes☆3,537Updated this week
- Cost-efficient and pluggable Infrastructure components for GenAI inference☆4,846Updated this week