GoogleCloudPlatform / nvidia-nemo-on-gkeLinks
Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine
☆13Updated last month
Alternatives and similar repositories for nvidia-nemo-on-gke
Users that are interested in nvidia-nemo-on-gke are comparing it to the libraries listed below
Sorting:
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆67Updated this week
- Volume Controller for Kubernetes☆67Updated 2 years ago
- ☆43Updated 4 months ago
- Test infrastructure and tooling for Kubeflow.☆62Updated 3 months ago
- Blueprints for Deploying Kubeflow on Google Cloud Platform and Anthos☆82Updated last year
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 7 years ago
- [DEPRECATED] GAE python based app which regularly collects information about GCP resources and stores them in BigQuery☆45Updated last year
- ☆76Updated last week
- ML Pipeline Generator is a tool for generating end-to-end pipelines composed of GCP components so that any customer can easily migrate th…☆50Updated 3 years ago
- GCP extensions for Jupyter and JupyterLab☆56Updated 2 months ago
- Repository used to main group ACLs used by Kubeflow developers☆18Updated this week
- Amazon SageMaker operator for Kubernetes☆149Updated last year
- Collection of Knative demos☆68Updated 2 years ago
- Terraform module for creating GKE clusters to run Kubeflow☆215Updated 4 years ago
- Secure HDFS Access from Kubernetes☆61Updated 4 years ago
- AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kub…☆318Updated last week
- Deep learning benchmark utility and optimization tips on EKS.☆48Updated 5 years ago
- ☆33Updated 6 years ago
- ☆34Updated last month
- GenAI inference performance benchmarking tool☆45Updated this week
- Ephemeral Hadoop clusters using Google Compute Platform☆135Updated 3 years ago
- Seldon Core Operator for Kubernetes☆12Updated 5 years ago
- This repository guides you through deploying a private GKE cluster and provides a base platform for hands-on exploration of several GKE r…☆53Updated 5 years ago
- ☆15Updated 4 years ago
- ☆47Updated last year
- Integration between knative and certmanager for managing TLS certs automatically.☆22Updated last year
- This is the shared project for two Kubernetes Engine demos☆18Updated last year
- 👩🔬[Experimental] Easily train and serve ML models on Kubernetes, directly from your python code.☆31Updated 6 years ago
- ☆14Updated 2 years ago
- Deploying EFA in EKS utilizing GPUDirectRDMA where supported☆37Updated 7 months ago