GoogleCloudPlatform / nvidia-nemo-on-gke
Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine
☆12Updated 2 months ago
Alternatives and similar repositories for nvidia-nemo-on-gke:
Users that are interested in nvidia-nemo-on-gke are comparing it to the libraries listed below
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆30Updated this week
- ☆39Updated 4 months ago
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64Updated last month
- ☆14Updated 3 months ago
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆96Updated this week
- ☆24Updated this week
- ☆66Updated 6 months ago
- JupyterLab extension to provide a Kubeflow specific left area for Notebooks deployment☆18Updated 4 years ago
- ☆14Updated this week
- Infrastructure as code for GPU accelerated managed Kubernetes clusters.☆49Updated last month
- Google Cloud Product Cataloging Solution using Generative AI☆41Updated last week
- This repository compiles prescriptive guidance and code samples demonstrating how to operationalize Google Research T5X framework on Goog…☆51Updated 8 months ago
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆31Updated this week
- A direct Google Cloud Storage integration for PyTorch☆33Updated 3 weeks ago
- This repository compiles prescriptive guidance and code samples for the operationalization of NVIDIA Merlin framework on Google Cloud Ver…☆34Updated 2 years ago
- Machine Learning Inference Graph Spec☆21Updated 5 years ago
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆151Updated last week
- Ray-based Apache Beam runner☆43Updated last year
- Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods☆21Updated last month
- Automated Quality Control for Dialogflow CX Agents☆14Updated 8 months ago
- Open Source Model Risk Management☆21Updated 2 years ago
- ☆26Updated last month
- Examples showing use of NGC containers and models withing Amazon SageMaker☆17Updated 2 years ago
- MLFlow Deployment Plugin for Ray Serve☆43Updated 2 years ago
- Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments…☆220Updated this week
- Provides for deploying custom ETL containers on AIStore, with subsequent user-defined extraction-transformation-loading in parallel, on t…☆15Updated this week
- struct2tensor is a library for parsing and manipulating structured data inside of tensorflow.☆34Updated last month
- ☆14Updated 2 years ago
- ☆34Updated 3 weeks ago