Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.
☆127Apr 7, 2026Updated this week
Alternatives and similar repositories for gpu-recipes
Users that are interested in gpu-recipes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆172Apr 1, 2026Updated last week
- This repository is a collection of accelerated platform best practices, reference architectures, example use cases, reference implementat…☆85Updated this week
- Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments…☆329Apr 3, 2026Updated last week
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆420Jan 5, 2026Updated 3 months ago
- ☆64Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12May 30, 2025Updated 10 months ago
- GCP PCI-DSS 3.2.1 InSpec Profile☆18May 26, 2021Updated 4 years ago
- ☆16Mar 13, 2025Updated last year
- AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kub…☆327Jun 23, 2025Updated 9 months ago
- TPU inference for vLLM, with unified JAX and PyTorch support.☆284Updated this week
- CUDA Template Functions☆20Dec 16, 2025Updated 3 months ago
- Automated Quality Control for Dialogflow CX Agents☆14May 3, 2024Updated last year
- Make tool-calling schemas for existing tools☆14Mar 8, 2025Updated last year
- ☆49Jan 5, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- AI/ML Recipes for Vertex AI, Serverless Spark and BigQuery open-source project is an effort to jumpstart your development of data process…☆71Mar 31, 2026Updated last week
- ☆321Apr 2, 2026Updated last week
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 4 months ago
- Android has APKs, Docker has OCIs - Edge now has Edge Containers☆11Mar 30, 2026Updated last week
- This repository contains the results and code for the MLPerf™ Inference v4.0 benchmark.☆11Jul 24, 2025Updated 8 months ago
- A Template for MLOps on Google Cloud Vertex AI☆13Mar 16, 2022Updated 4 years ago
- Collection of OSS models that are containerized into a serving container☆16Sep 19, 2023Updated 2 years ago
- ☆11Sep 6, 2024Updated last year
- ☆45Updated this week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆13Oct 12, 2020Updated 5 years ago
- Optimized primitives for collective multi-GPU communication☆10May 8, 2024Updated last year
- ☆13Dec 6, 2022Updated 3 years ago
- DGXC Benchmarking provides recipes in ready-to-use templates for evaluating performance of specific AI use cases across hardware and soft…☆79Updated this week
- A collection of Python agent samples built with the Google Agent Development Kit (ADK), demonstrating integrations with services like B…☆15Apr 1, 2026Updated last week
- A Helm Chart with pre-configured tools for your Container Engine clusters☆19Jan 5, 2018Updated 8 years ago
- ☆10Dec 19, 2017Updated 8 years ago
- ☆10Mar 23, 2023Updated 3 years ago
- WG Serving☆34Mar 24, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Machine learning (ML) inference on Fastly's Compute@Edge☆16Jun 11, 2024Updated last year
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆65Mar 11, 2026Updated 3 weeks ago
- Pubsub2Inbox is a versatile, multi-purpose tool to handle Pub/Sub messages and turn them into email, API calls, GCS objects, files or alm…☆46Mar 29, 2026Updated last week
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆79Dec 18, 2025Updated 3 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆29Mar 1, 2025Updated last year
- A simple, performant and scalable Jax LLM!☆2,201Updated this week
- This repo hosts code for vLLM CI & Performance Benchmark infrastructure.☆33Updated this week