NascentCore / 3k
3-k platform is for training LLMs
☆14Updated this week
Alternatives and similar repositories for 3k:
Users that are interested in 3k are comparing it to the libraries listed below
- An HPC and Cloud Computing Fused Job Scheduling System☆91Updated this week
- InfiniBand SR-IOV CNI☆12Updated 2 weeks ago
- RDMA CNI plugin for containerized workloads☆51Updated this week
- Ubuntu kernels which are optimized for NVIDIA server systems☆36Updated this week
- NVIDIA NCCL Tests for Distributed Training☆85Updated 2 weeks ago
- ☆60Updated this week
- ☆31Updated 3 years ago
- ☆42Updated 10 months ago
- Hooked CUDA-related dynamic libraries by using automated code generation tools.☆150Updated last year
- Prometheus exporter for a Infiniband Fabric☆59Updated last year
- ☆34Updated this week
- ☆14Updated 3 years ago
- The BeeGFS Container Storage Interface (CSI) driver provides high performing and scalable storage for workloads running in Kubernetes. 📦…☆67Updated 2 months ago
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆103Updated this week
- Device-plugin for volcano vgpu which support hard resource isolation☆67Updated 2 weeks ago
- A general-purpose GPU monitor, witch can monitor GPU cards and the usage of each pods or containers.☆19Updated 3 years ago
- InfiniBand SR-IOV CNI☆46Updated 2 weeks ago
- ☆132Updated 3 years ago
- This repository provides installation scripts and configuration files for deploying the CSGHub instance, includes Helm charts and Docker…☆13Updated this week
- MIG Partition Editor for NVIDIA GPUs☆192Updated last week
- A diverse, simple, and secure all-in-one LLMOps platform☆101Updated 6 months ago
- ☆13Updated last month
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆146Updated 3 weeks ago
- An I/O benchmark for deep Learning applications☆82Updated this week
- Kubernetes Rdma SRIOV device plugin☆110Updated 4 years ago
- Transparent checkpoint/restart library for CUDA application.☆12Updated 10 years ago
- HTTP based Tree-shaped Peer2Peer blob transfer proxy, distributing images or blob data.☆20Updated 2 years ago
- Testing if I can implement slurm in an operator☆14Updated 4 months ago
- The API (CRD) of Volcano☆37Updated this week
- ☆522Updated 9 months ago