SergioMEV / slurm-for-dummiesLinks
A dummy's guide to setting up (and using) HPC clusters on Ubuntu 22.04LTS using Slurm and Munge. Created by the Quant Club @ UIowa.
☆378Updated last year
Alternatives and similar repositories for slurm-for-dummies
Users that are interested in slurm-for-dummies are comparing it to the libraries listed below
Sorting:
- Instructions for setting up a SLURM cluster using Ubuntu 18.04.3 with GPUs.☆153Updated this week
- A Slurm cluster using docker-compose☆409Updated last week
- TUI for the Slurm Workload Manager☆224Updated 2 months ago
- My tools for the Slurm HPC workload manager☆554Updated last month
- Container plugin for Slurm Workload Manager☆396Updated this week
- Jobstats is a job monitoring platform for CPU and GPU clusters☆108Updated 3 weeks ago
- Python Interface to Slurm☆551Updated last week
- ☆360Updated 2 months ago
- Steps to create a small slurm cluster with GPU enabled nodes☆271Updated 2 years ago
- A Slurm dashboard for the terminal.☆99Updated last year
- Where GPUs get cooked 👩🍳🔥☆310Updated 2 months ago
- JAX-Toolbox☆359Updated this week
- How to use Singularity!☆68Updated 5 years ago
- A repository of definition files for bootstrapping Singularity containers around the software applications, frameworks, and libraries you…☆63Updated 2 weeks ago
- Open source web interface for Slurm HPC & AI clusters☆508Updated this week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆306Updated 2 weeks ago
- ☆61Updated 2 years ago
- A simple Python wrapper for Slurm with flexibility in mind.☆159Updated 6 months ago
- ☆177Updated last year
- ☆100Updated last year
- Repository of machine learning benchmarks☆45Updated this week
- Package for extracting and mapping the results of every single tensor operation in a PyTorch model in one line of code.☆629Updated last month
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆683Updated last week
- Material for the SC22 Deep Learning at Scale Tutorial☆41Updated 2 years ago
- Jupyter notebooks, jobscripts and other files for the "Getting started with AI on LUMI" workshop☆42Updated last month
- Ansible role for installing and managing the Slurm Workload Manager☆111Updated 7 months ago
- NVIDIA Math Libraries for the Python Ecosystem☆532Updated 2 months ago
- This material contains content on how to profile and optimize simple Pytorch mnist code using NVIDIA Nsight Systems and Pytorch Profiler☆20Updated 2 years ago
- ☆232Updated this week
- Tutorial for installing Open XDMoD, OnDemand, & ColdFront☆159Updated 5 months ago