PKUHPC / OpenSCOWLinks
Super Computing On Web
☆293Updated last week
Alternatives and similar repositories for OpenSCOW
Users that are interested in OpenSCOW are comparing it to the libraries listed below
Sorting:
- An HPC and Cloud Computing Fused Job Scheduling System☆111Updated this week
- Front end code of Crane☆30Updated last week
- ☆15Updated last month
- SJTU HPC 用户文档站点☆177Updated 2 months ago
- OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow app…☆570Updated last year
- Metastack: an enhanced and performance optimized version of Slurm☆53Updated last month
- Slurm in Docker - Exploring Slurm using CentOS 7 based Docker images☆129Updated 5 years ago
- Open source web interface for Slurm HPC & AI clusters☆477Updated last month
- Open OnDemand Application Collection for SJTU HPC☆13Updated 4 years ago
- ☆535Updated last year
- 高性能计算系统性能评价工具集☆21Updated last year
- Prometheus exporter for a Infiniband Fabric☆66Updated last year
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆692Updated last year
- Supercomputing. Seamlessly. Open, Interactive HPC Via the Web☆374Updated this week
- A shim driver allows in-docker nvidia-smi showing correct process list without modify anything☆93Updated last month
- ☆67Updated 7 months ago
- ☆277Updated 2 years ago
- Lustre Monitoring System based on Collectd, Grafana and Influxdb☆45Updated last year
- SJTU HPC 开源项目:Spackenv (Spack ENVironment) switch environments between sysadmin, users and developers.☆22Updated 3 years ago
- Cluster/HPC installation for diskless compute nodes☆46Updated 2 months ago
- A Slurm cluster using docker-compose☆390Updated last month
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆565Updated 2 weeks ago
- torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics c…☆434Updated 2 weeks ago
- ☆883Updated last year
- NVIDIA NCCL Tests for Distributed Training☆110Updated this week
- Build NCCL-Tests and configure SSHD in PyTorch container to help you test NCCL faster!☆12Updated this week
- run DeepSeek-R1 GGUFs on KTransformers☆250Updated 5 months ago
- 收录SC小组在学习高性能计算、分布式架构、数据挖掘与人工智能方向的笔记和材料☆13Updated 3 years ago
- Prometheus exporter for performance metrics from Slurm.☆261Updated last year
- LBNL Node Health Check☆256Updated 4 months ago