albanie / slurm_gpustat
A simple command line tool to show GPU usage on a SLURM cluster
☆107Updated 11 months ago
Alternatives and similar repositories for slurm_gpustat:
Users that are interested in slurm_gpustat are comparing it to the libraries listed below
- Website-based resource monitor for Slurm system☆35Updated last year
- A Machine Learning workflow for Slurm.☆149Updated 4 years ago
- yaspi - Yet Another Slurm Python Interface☆44Updated 2 years ago
- An unopinionated replacement for PyTorch's Dataset and ImageFolder, that handles Tar archives☆76Updated 2 years ago
- Example of how to use Weights & Biases on Slurm☆113Updated 2 years ago
- A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!☆72Updated 3 weeks ago
- ☆117Updated 2 months ago
- gpu tester detects broken and slow gpus in a cluster☆68Updated 2 years ago
- Natural Language Descriptions of Deep Visual Features, ICLR 2022☆62Updated last year
- ☆81Updated 7 months ago
- Beyond Straight-Through☆94Updated last year
- Understanding the Difficulty of Training Transformers☆44Updated 2 years ago
- [NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts☆59Updated 2 years ago
- A curated list of techniques to avoid posterior collapse☆87Updated 2 years ago
- MaskedTensors for PyTorch☆39Updated 2 years ago
- This is the CUDA GPU implementation + Python interface (using PyTorch) of DCI. The paper can be found at https://arxiv.org/abs/1512.00442…☆12Updated last year
- Framework code with wandb, checkpointing, logging, configs, experimental protocols. Useful for fine-tuning models or training from scratc…☆149Updated 2 years ago
- ☆201Updated 2 years ago
- A Domain-Agnostic Benchmark for Self-Supervised Learning☆107Updated last year
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆87Updated last year
- Efficient reservoir sampling implementation for PyTorch☆107Updated 3 years ago
- Joint Academic Data Science Endeavour (JADE) is the largest GPU facility in the UK supporting world-leading research in machine learning …☆24Updated last week
- ☆164Updated 2 years ago
- Collaborative documentation for and from Jean Zay users. Official Jean Zay documentation: http://www.idris.fr/eng/jean-zay/☆118Updated 8 months ago
- ☆64Updated last year
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes …☆84Updated 2 years ago
- Differentiable Top-k Classification Learning☆80Updated 2 years ago
- Code for paper "Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions"☆85Updated 3 years ago
- CUDA kernels for generalized matrix-multiplication in PyTorch☆79Updated 3 years ago