NascentCore / 3kLinks
Orchestrating many small GPU clusters for running serverless GPU workloads
☆15Updated 6 months ago
Alternatives and similar repositories for 3k
Users that are interested in 3k are comparing it to the libraries listed below
Sorting:
- A distributed scheduling system for HPC and AI workloads☆120Updated this week
- Cloyster HPC is a turnkey HPC cluster solution with an user-friendly installer☆10Updated last month
- InfiniBand SR-IOV CNI☆14Updated last week
- NVIDIA NCCL Tests for Distributed Training☆118Updated this week
- A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.☆24Updated last week
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆86Updated last year
- A diverse, simple, and secure all-in-one LLMOps platform☆109Updated last year
- A toolkit for discovering cluster network topology.☆74Updated this week
- This repository provides installation scripts and configuration files for deploying the CSGHub instance, includes Helm charts and Docker…☆16Updated last week
- Terraform provider for BaiduCloud☆24Updated this week
- RDMA CNI plugin for containerized workloads☆58Updated this week
- Golang bindings for Nvidia Datacenter GPU Manager (DCGM)☆137Updated last week
- A Slurm cluster for Kubernetes☆65Updated last year
- 配合 HAI Platform 使用的集成化用户界面☆53Updated 2 years ago
- Prometheus exporter for a Infiniband Fabric☆67Updated last year
- 国产加速卡-海光DCU实战(大模型训练、微调、推理 等)☆52Updated 2 months ago
- The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.☆41Updated last week
- InfiniBand SR-IOV CNI☆54Updated this week
- ☆68Updated this week
- Device-plugin for volcano vgpu which support hard resource isolation☆119Updated last month
- Intelligent platform for AI workloads☆37Updated 2 years ago
- KJob: Tool for CLI-loving ML researchers☆39Updated this week
- Offline optimization of your disaggregated Dynamo graph☆88Updated this week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆24Updated 10 months ago
- Bitfusion with Kubernetes Integration Support