GPU environment and cluster management with LLM support
☆659May 16, 2024Updated last year
Alternatives and similar repositories for genv
Users that are interested in genv are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A top-like tool for monitoring GPUs in a cluster☆85Feb 14, 2024Updated 2 years ago
- GPU Environment Management for Visual Studio Code☆39Jul 19, 2023Updated 2 years ago
- ☆297Mar 19, 2026Updated last month
- markdown docs☆96Feb 1, 2026Updated 2 months ago
- ☆243Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,233Updated this week
- GPU Environment Management for JupyterLab☆26Feb 19, 2024Updated 2 years ago
- Tensors, for human consumption☆1,384Apr 9, 2026Updated last week
- ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing☆90Mar 12, 2026Updated last month
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ cl…☆9,866Updated this week
- ☆799Mar 23, 2026Updated 3 weeks ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆295Updated this week
- Control plane for agents and engineers to provision compute and run training and inference across NVIDIA, AMD, TPU, and Tenstorrent GPUs—…☆2,090Updated this week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,635Apr 10, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Practical GPU Sharing Without Memory Size Constraints☆308Mar 28, 2025Updated last year
- A Datacenter Scale Distributed Inference Serving Framework☆6,570Updated this week
- GPUd automates monitoring, diagnostics, and issue identification for GPUs☆481Updated this week
- DRA Driver for NVIDIA GPUs☆626Updated this week
- GPU plugin to the node feature discovery for Kubernetes☆307May 27, 2024Updated last year
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆52Updated this week
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated 11 months ago
- A JupyterLab extension for displaying dashboards of GPU usage.☆668Feb 23, 2026Updated last month
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,573Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Aug 27, 2024Updated last year
- NVIDIA device plugin for Kubernetes☆3,723Updated this week
- ☆194Jan 20, 2026Updated 3 months ago
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆697Updated this week
- Self-hosted huggingface mirror service. 自建huggingface镜像服务。☆227Mar 14, 2026Updated last month
- Tools for building GPU clusters☆1,430Feb 23, 2026Updated last month
- Simple, safe way to store and distribute tensors☆3,711Updated this week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆524Updated this week
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆145Nov 21, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Build and run containers leveraging NVIDIA GPUs☆4,252Updated this week
- Python client for the Run:ai REST API☆24Dec 15, 2025Updated 4 months ago
- GPU Sharing Scheduler for Kubernetes Cluster☆1,531Dec 29, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆76,536Updated this week
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,186Mar 31, 2026Updated 2 weeks ago
- A toolkit to run Ray applications on Kubernetes☆2,448Updated this week
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes☆5,332Updated this week