kubeflow / mpi-operatorLinks
Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
☆487Updated last week
Alternatives and similar repositories for mpi-operator
Users that are interested in mpi-operator are comparing it to the libraries listed below
Sorting:
- GPU plugin to the node feature discovery for Kubernetes☆302Updated last year
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆526Updated last year
- Device plugins for Volcano, e.g. GPU☆126Updated 4 months ago
- Common APIs and libraries shared by other Kubeflow operator repositories.☆52Updated 2 years ago
- ☆283Updated last week
- elastic-gpu-scheduler is a Kubernetes scheduler extender for GPU resources scheduling.☆142Updated 2 years ago
- NVIDIA DRA Driver for GPUs☆402Updated this week
- Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times☆88Updated 3 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆87Updated 6 months ago
- GPU Sharing Device Plugin for Kubernetes Cluster☆487Updated 2 years ago
- A CLI for Kubeflow.☆786Updated last week
- MIG Partition Editor for NVIDIA GPUs☆207Updated this week
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆127Updated 3 years ago
- Share GPU between Pods in Kubernetes☆211Updated 2 years ago
- AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads☆205Updated last year
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆201Updated 3 years ago
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,091Updated 2 years ago
- ☆119Updated 2 years ago
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆529Updated this week
- ☆132Updated 4 years ago
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆116Updated last month
- Automatic tuning for ML model deployment on Kubernetes☆80Updated 9 months ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆246Updated this week
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆195Updated this week
- Holistic job manager on Kubernetes☆117Updated last year
- NVIDIA Network Operator☆268Updated last week
- ☆533Updated last year
- Controller for ModelMesh☆239Updated last month
- Docker for Your ML/DL Models Based on OCI Artifacts☆471Updated last year
- RDMA device plugin for Kubernetes☆217Updated last year