A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.
☆224Feb 21, 2026Updated last week
Alternatives and similar repositories for nim-deploy
Users that are interested in nim-deploy are comparing it to the libraries listed below
Sorting:
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆150Updated this week
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆209May 1, 2025Updated 10 months ago
- 🚀 Use NVIDIA NIMs with Haystack pipelines☆32Sep 4, 2024Updated last year
- ☆57Feb 5, 2026Updated 3 weeks ago
- Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.☆511Apr 18, 2025Updated 10 months ago
- HyDE based RAG using NVIDIA NIM.☆16Mar 20, 2024Updated last year
- NVIDIA DRA Driver for GPUs☆574Updated this week
- Run cloud native workloads on NVIDIA GPUs☆228Jan 22, 2026Updated last month
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆3,816Feb 5, 2026Updated 3 weeks ago
- Performance tests for multinode NGC.Ready certification☆15Jan 28, 2026Updated last month
- MIG Partition Editor for NVIDIA GPUs☆241Updated this week
- Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inferen…☆73Updated this week
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- Comprehensive, scalable ML inference architecture using Amazon EKS, leveraging Graviton processors for cost-effective CPU-based inference…☆21Feb 14, 2026Updated 2 weeks ago
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆129Feb 24, 2026Updated last week
- Configures and builds a database for engagement events generated by Amazon Simple Email Service (SES) and Amazon Pinpoint engagements usi…☆13Jan 16, 2025Updated last year
- This repository contains the results and code for the MLPerf™ Training v3.0 benchmark.☆12Aug 10, 2023Updated 2 years ago
- WG Serving☆34Dec 15, 2025Updated 2 months ago
- markdown docs☆94Feb 1, 2026Updated last month
- Inference API server with echo and gRPC to triton server (golang)☆13Nov 16, 2022Updated 3 years ago
- ☆67Mar 28, 2025Updated 11 months ago
- NVIDIA Fleet Command is a hybrid-cloud platform for securely and remotely deploying, managing, and scaling AI across dozens or up to thou…☆14Jul 20, 2022Updated 3 years ago
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆673Updated this week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,572Updated this week
- ☆18Aug 19, 2024Updated last year
- Visual Studio Code Extension without Terraform Cloud and Telemetry☆18Nov 18, 2025Updated 3 months ago
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Oct 25, 2023Updated 2 years ago
- This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…☆18Mar 13, 2025Updated 11 months ago
- Graph-based learning in Python☆17Mar 9, 2018Updated 7 years ago
- A collection of Ansible assets for use with Ansible-based operators built with the operator-sdk.☆20Oct 30, 2023Updated 2 years ago
- Gateway API Inference Extension☆597Updated this week
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.☆1,842Updated this week
- ☆189Updated this week
- ☆286Updated this week
- Linux-CAN / SocketCAN documentation☆27Feb 8, 2024Updated 2 years ago
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆12,938Updated this week
- This repository to demonstrate an application built with Java 21 + SrpingBoot 3 + MyBatis including CRUD operations, authentication, rout…☆12Dec 1, 2024Updated last year
- ☆31Feb 18, 2026Updated last week
- Volcano website and documentation repo: https://volcano.sh☆30Feb 5, 2026Updated 3 weeks ago