run-ai / rntopLinks

A top-like tool for monitoring GPUs in a cluster

☆85

Alternatives and similar repositories for rntop

Users that are interested in rntop are comparing it to the libraries listed below

Sorting:

mlcommons / mlcube
MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.
☆157Updated last week
kserve / open-inference-protocol
Repository for open inference protocol specification
☆60Updated 6 months ago
NVIDIA / ais-k8s
Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.
☆113Updated last week
run-ai / docs
markdown docs
☆92Updated last month
run-ai / genv
GPU environment and cluster management with LLM support
☆654Updated last year
run-ai / vscode-genv
GPU Environment Management for Visual Studio Code
☆39Updated 2 years ago
coreweave / ml-containers
☆42Updated this week
coreweave / tensorizer
Module, Model, and Tensor Serialization/Deserialization
☆276Updated 3 months ago
NVIDIA / pyxis
Container plugin for Slurm Workload Manager
☆398Updated 3 weeks ago
clearml / clearml-fractional-gpu
ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing
☆85Updated 3 weeks ago
ray-project / mlflow-ray-serve
MLFlow Deployment Plugin for Ray Serve
☆46Updated 3 years ago
kserve / modelmesh-serving
Controller for ModelMesh
☆242Updated 5 months ago
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆213Updated 7 months ago
kserve / modelmesh
Distributed Model Serving Framework
☆179Updated 2 months ago
NVIDIA / cloud-native-stack
Run cloud native workloads on NVIDIA GPUs
☆208Updated last month
coreweave / kubernetes-cloud
Getting Started with the CoreWeave Kubernetes GPU Cloud
☆79Updated 5 months ago
k3ai / k3ai
A lightweight tool to get an AI Infrastructure Stack up in minutes not days. K3ai will take care of setup K8s for You, deploy the AI tool…
☆124Updated 3 years ago
determined-ai / works-with-determined
This repository contains example integrations between Determined and other ML products
☆48Updated last year
meta-pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆402Updated last week
lambdal / lambda-stack-dockerfiles
☆282Updated 8 months ago
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆166Updated last week
terrytangyuan / awesome-kubeflow
A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)
☆220Updated last month
chassisml / chassis
Chassis turns machine learning models into portable container images that can run just about anywhere.
☆86Updated last year
NVIDIA / nephele
Tools to deploy GPU clusters in the Cloud
☆33Updated 2 years ago
run-ai / runai-model-streamer
☆267Updated last week
mlflow / mlflow-torchserve
Plugin for deploying MLflow models to TorchServe
☆110Updated 2 years ago
ray-project / kuberay-helm
Helm charts for the KubeRay project
☆58Updated last week
mlspec / MLSpec
Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…
☆94Updated last year
IBM / text-generation-inference
IBM development fork of https://github.com/huggingface/text-generation-inference
☆62Updated 2 months ago
clearml / clearml-helm-charts
Helm chart repository for the new unified way to deploy ClearML on Kubernetes. ClearML - Auto-Magical CI/CD to streamline your AI workloa…
☆43Updated 3 months ago