NascentCore / 3kLinks
Orchestrating many small GPU clusters for running serverless GPU workloads
☆14Updated 5 months ago
Alternatives and similar repositories for 3k
Users that are interested in 3k are comparing it to the libraries listed below
Sorting:
- InfiniBand SR-IOV CNI☆14Updated last week
- ☆65Updated this week
- Public repository for the BeeGFS Parallel File System☆165Updated 3 months ago
- RDMA CNI plugin for containerized workloads☆57Updated this week
- InfiniBand SR-IOV CNI☆54Updated this week
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆38Updated this week
- A Slurm cluster for Kubernetes☆65Updated last year
- A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.☆25Updated this week
- Golang bindings for Nvidia Datacenter GPU Manager (DCGM)☆134Updated last week
- ☆20Updated last year
- NVIDIA NCCL Tests for Distributed Training☆112Updated 2 weeks ago
- Health checks for Azure N- and H-series VMs.☆52Updated this week
- Intelligent platform for AI workloads☆37Updated 2 years ago
- Prometheus exporter for a Infiniband Fabric☆67Updated last year
- Bitfusion with Kubernetes Integration Support☆50Updated last year
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆86Updated last year
- Run Slurm on Kubernetes. A Slinky project.☆173Updated this week
- GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types…☆110Updated 3 months ago
- The BeeGFS Container Storage Interface (CSI) driver provides high performing and scalable storage for workloads running in Kubernetes. 📦…☆71Updated this week
- ☆43Updated last year
- Terraform provider for BaiduCloud☆24Updated last month
- IP Over Infiniband (IPoIB) CNI Plugin☆15Updated this week
- An HPC and Cloud Computing Fused Job Scheduling System☆118Updated last week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆24Updated 10 months ago
- ☆28Updated last year
- ☆39Updated this week
- Ubuntu kernels which are optimized for NVIDIA server systems☆62Updated this week
- IPAM plugin for kubernetes☆28Updated this week
- GPUd automates monitoring, diagnostics, and issue identification for GPUs☆437Updated last week
- Testing if I can implement slurm in an operator☆15Updated 11 months ago