A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.
☆236May 15, 2026Updated last week
Alternatives and similar repositories for nim-deploy
Users that are interested in nim-deploy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆237May 1, 2025Updated last year
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆158May 13, 2026Updated last week
- 🚀 Use NVIDIA NIMs with Haystack pipelines☆32Sep 4, 2024Updated last year
- ☆60Feb 5, 2026Updated 3 months ago
- Source Code and Usage Samples for the Resources hosted in the NVIDIA AI Enterprise AzureML Registry☆21Aug 7, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆4,023Mar 30, 2026Updated last month
- Project demonstrates the power and simplicity of NVIDIA NIM (NVIDIA Inference Model), a suite of optimized cloud-native microservices, by…☆16Mar 21, 2024Updated 2 years ago
- ☆11Mar 16, 2026Updated 2 months ago
- ☆12Dec 20, 2025Updated 5 months ago
- Re-scoring a set of docked ligands with off-the-shelf algorithms to assess utility in virtual screening☆11Oct 13, 2021Updated 4 years ago
- HyDE based RAG using NVIDIA NIM.☆16Mar 20, 2024Updated 2 years ago
- Easy to use python wrapper around Deepstream Python bindings☆14Oct 11, 2021Updated 4 years ago
- ☆20Mar 11, 2026Updated 2 months ago
- ☆40Mar 13, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- MIG Partition Editor for NVIDIA GPUs☆252Updated this week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆132Updated this week
- Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inferen…☆74May 8, 2026Updated 2 weeks ago
- Implementation of SCINS☆16Nov 6, 2024Updated last year
- Kubernetes Operator, Helm Charts, Ansible Playbooks, and utility scripts for large-scale AIStore deployments on Kubernetes.☆132Updated this week
- NVIDIA integrations with LangChain☆200May 13, 2026Updated last week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,701May 14, 2026Updated last week
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Apr 1, 2020Updated 6 years ago
- Tools to deploy GPU clusters in the Cloud☆34Apr 4, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Plugins for Sonobuoy☆62May 20, 2025Updated last year
- Create, List, Update, Delete Amazon EKS clusters. Deploy and manage software on EKS. Run distributed model training and inference example…☆66May 15, 2026Updated last week
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)☆368Aug 12, 2025Updated 9 months ago
- WG Serving☆35Mar 24, 2026Updated last month
- ☆15Apr 13, 2026Updated last month
- The NVIDIA NeMo Agent Toolkit UI streamlines interacting with NeMo Agent Toolkit workflows in an easy-to-use web application.☆99May 15, 2026Updated last week
- This repository contains the results and code for the MLPerf™ Training v3.0 benchmark.☆12Aug 10, 2023Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆26May 4, 2026Updated 2 weeks ago
- ☆14May 29, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A Datacenter Scale Distributed Inference Serving Framework☆6,941Updated this week
- GPU Environment Management for JupyterLab☆26Feb 19, 2024Updated 2 years ago
- Graph-based learning in Python☆17Mar 9, 2018Updated 8 years ago
- ☆25Apr 4, 2026Updated last month
- This repository contains tutorials and examples for Triton Inference Server☆838May 8, 2026Updated 2 weeks ago
- ☆27Aug 4, 2025Updated 9 months ago
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆13,669Updated this week