This repo includes everything you need to know about deploying GPU nodes on OCI
☆54Jun 12, 2026Updated this week
Alternatives and similar repositories for oci-hpc-oke
Users that are interested in oci-hpc-oke are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Terraform examples for deploying HPC clusters on OCI☆66May 26, 2026Updated 2 weeks ago
- NVIDIA NCCL Tests for Distributed Training☆146Jun 2, 2026Updated last week
- Go client for StackPath APIs☆11Jul 18, 2025Updated 10 months ago
- InfiniBand fabric monitoring daemon written in Go☆32May 22, 2025Updated last year
- Optimized primitives for collective multi-GPU communication☆11May 8, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A toolkit for discovering cluster network topology.☆132Updated this week
- Kubeflow on OCI☆10Jul 27, 2022Updated 3 years ago
- ocifs provides a POSIX-compatible API wrapping Oracle Cloud Infrastructure's (OCI) Object Storage. ocifs is a python library that relies …☆22Nov 6, 2025Updated 7 months ago
- This repository contains the results and code for the MLPerf™ Training v4.0 benchmark.☆12Jun 11, 2024Updated 2 years ago
- RPerf: Accurate Latency Measurement Framework for RDMA☆15Apr 14, 2026Updated 2 months ago
- Gateway Sidecar Opensource Repository: The slice VPN Gateway is a slice network service component that provides a secure VPN tunnel betwe…☆20Dec 30, 2025Updated 5 months ago
- pytorch code examples for measuring the performance of collective communication calls in AI workloads☆21Sep 18, 2025Updated 8 months ago
- A small POC using Caddy as a TLS-terminating MQTT proxy☆12Aug 31, 2022Updated 3 years ago
- JSON Logging for Sanic☆10Sep 1, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Build a Slurm Cluster using SaltStack in virtual machines☆12Nov 26, 2018Updated 7 years ago
- Benchmarking guide for the Azure AI Infrastructure.☆41Updated this week
- This creates a simple LiveISO customised for your use☆14Dec 24, 2019Updated 6 years ago
- 🛤 Ansible setup for building a WireGuard reverse proxy server☆14Feb 26, 2020Updated 6 years ago
- Docker XPRA HTML5 Image with OpenGL support for NVIDIA cards☆13Oct 28, 2020Updated 5 years ago
- kubeslice-cli: Repository for maintaining code of kubeslice cli utility☆25Oct 7, 2025Updated 8 months ago
- The official documentation for Kubeslice project☆30Feb 12, 2026Updated 4 months ago
- SAML Authentication Plugin for Caddy v2☆16Sep 25, 2020Updated 5 years ago
- Run Slurm in Kubernetes☆389Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Intel Management Engine JTAG Proof of Concept - 2022 Instructions☆32Sep 4, 2022Updated 3 years ago
- /j f t/ - YAML file tool☆14Apr 28, 2026Updated last month
- nv-one-logger enables tracking of GPU application progress over time and can help to identify overhead from workload and cluster ineffici…☆23Nov 6, 2025Updated 7 months ago
- Example Achilles SDK controller for tutorial purposes.☆14Jun 18, 2025Updated 11 months ago
- Research Computing Framework Based on Singularity and Lmod☆10Aug 22, 2020Updated 5 years ago
- ITIX's Custom CoreOS build☆10Nov 24, 2020Updated 5 years ago
- Singularity recipes for OpenFOAM☆12Jan 2, 2022Updated 4 years ago
- Examples using different CI systems to build with Earthly.☆12Oct 1, 2021Updated 4 years ago
- A command line utility to manage the configuration of a system's high performance network interfaces for RoCE deployments☆36Jul 25, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Cluster doctor skills☆14May 23, 2026Updated 3 weeks ago
- Terraform module to deploy Redis on Oracle Cloud Infrastructure (OCI)☆13Aug 21, 2025Updated 9 months ago
- NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated compu…☆315Updated this week
- operator that tracks SelinuxPolicy objects in certain namespaces.☆12Jun 26, 2022Updated 3 years ago
- Pulumi provider for Vultr☆26Feb 1, 2026Updated 4 months ago
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆33May 23, 2024Updated 2 years ago
- Authenticating HTTP(S) proxy with TCP/IP tunneling and acceleration—mirror of http://svn.awk.cz/cntlm☆15Jan 21, 2013Updated 13 years ago