GPU environment and cluster management with LLM support
☆658May 16, 2024Updated 2 years ago
Alternatives and similar repositories for genv
Users that are interested in genv are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A top-like tool for monitoring GPUs in a cluster☆85Feb 14, 2024Updated 2 years ago
- GPU Environment Management for Visual Studio Code☆39Jul 19, 2023Updated 2 years ago
- ☆318Updated this week
- ☆280Jun 17, 2026Updated last week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,339Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Tensors, for human consumption☆1,388Apr 9, 2026Updated 2 months ago
- ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing☆93Mar 12, 2026Updated 3 months ago
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ cl…☆10,224Updated this week
- ☆807Apr 28, 2026Updated 2 months ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆313Jun 16, 2026Updated 2 weeks ago
- Vendor-agnostic orchestration for training, inference and agentic workloads across NVIDIA, AMD, TPU, and Tenstorrent on clouds, Kubernete…☆2,168Updated this week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,752Updated this week
- A Datacenter Scale Distributed Inference Serving Framework☆7,352Updated this week
- Practical GPU Sharing Without Memory Size Constraints☆313Mar 28, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GPUd automates monitoring, diagnostics, and issue identification for GPUs☆484Updated this week
- DRA Driver for NVIDIA GPUs☆662Updated this week
- GPU plugin to the node feature discovery for Kubernetes☆309May 27, 2024Updated 2 years ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆27Apr 24, 2025Updated last year
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆54Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,776Updated this week
- A JupyterLab extension for displaying dashboards of GPU usage.☆675Updated this week
- ☆12Aug 27, 2024Updated last year
- NVIDIA device plugin for Kubernetes☆3,797Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆206Jun 21, 2026Updated last week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆748Updated this week
- Tools for building GPU clusters☆1,446Jun 2, 2026Updated 3 weeks ago
- Self-hosted huggingface mirror service. 自建huggingface镜像服务。☆246Mar 14, 2026Updated 3 months ago
- Simple, safe way to store and distribute tensors☆3,790Jun 19, 2026Updated last week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆528Updated this week
- Python client for the Run:ai REST API☆25Dec 15, 2025Updated 6 months ago
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆146Nov 21, 2022Updated 3 years ago
- Build and run containers leveraging NVIDIA GPUs☆4,440Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- GPU Sharing Scheduler for Kubernetes Cluster☆1,533Dec 29, 2023Updated 2 years ago
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,213Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆83,677Updated this week
- Rust crates for XetHub☆87Oct 16, 2024Updated last year
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes☆5,610Updated this week
- A toolkit to run Ray applications on Kubernetes☆2,562Updated this week
- Combo.jl: Combinatorial Optimization in Julia☆16Dec 18, 2019Updated 6 years ago