nateGeorge / slurm_gpu_ubuntuLinks
Instructions for setting up a SLURM cluster using Ubuntu 18.04.3 with GPUs.
☆152Updated 4 years ago
Alternatives and similar repositories for slurm_gpu_ubuntu
Users that are interested in slurm_gpu_ubuntu are comparing it to the libraries listed below
Sorting:
- Steps to create a small slurm cluster with GPU enabled nodes☆270Updated 2 years ago
- My tools for the Slurm HPC workload manager☆527Updated last week
- A Slurm cluster using docker-compose☆384Updated 3 weeks ago
- A dummy's guide to setting up (and using) HPC clusters on Ubuntu 22.04LTS using Slurm and Munge. Created by the Quant Club @ UIowa.☆332Updated last year
- Container plugin for Slurm Workload Manager☆369Updated this week
- Python Interface to Slurm☆532Updated 3 weeks ago
- A Slurm dashboard for the terminal.☆88Updated last year
- ☆59Updated 2 years ago
- Scheduling GPU cluster workloads with Slurm☆75Updated 6 years ago
- Slurm in Docker - Exploring Slurm using CentOS 7 based Docker images☆129Updated 5 years ago
- Jobstats is a job monitoring platform for CPU and GPU clusters☆81Updated last week
- Ansible role for installing and managing the Slurm Workload Manager☆107Updated 4 months ago
- Instructions for setting up a Slurm gpu cluster on Ubuntu 22.04.☆27Updated last year
- Benchmark Suite for Deep Learning☆272Updated 5 months ago
- SLURM Example Scripts☆71Updated 5 years ago
- Open source web interface for Slurm HPC & AI clusters☆467Updated 2 weeks ago
- How to Configure a GPU Cluster Running Ubuntu Linux☆59Updated 8 years ago
- HPC Container Maker☆489Updated 3 weeks ago
- Slurm-Mail is a drop in replacement for Slurm's e-mails to give users much more information about their jobs compared to the standard Slu…☆111Updated this week
- A simple command line tool to show GPU usage on a SLURM cluster☆110Updated last year
- Provide Python access to the NVML library for GPU diagnostics☆243Updated 8 months ago
- Tutorial for using Singularity containers☆117Updated 4 years ago
- Prometheus exporter for performance metrics from Slurm.☆256Updated last year
- Distributed K-FAC preconditioner for PyTorch☆89Updated this week
- A logging tool for deep learning.☆60Updated 4 months ago
- Material for the SC21 Deep Learning at Scale Tutorial☆26Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.☆57Updated 2 years ago
- ☆100Updated 10 months ago
- Slurm on Google Cloud Platform☆189Updated 10 months ago
- A GPU performance profiling tool for PyTorch models☆503Updated 4 years ago